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Compositions and Methods for Polynucleotide Delivery 

Description 

Field of the Invention 

This invention relates to compositions and methods for increasing the 
5 uptake of polynucleotides into cells. Specifically, the invention relates to 

vectors, targeting ligands, and polycationic agents. The polycationic agents are 
capable of (1) increasing the frequency of uptake of polynucleotides into acelU 
(2) condensing polynucleotides; and (3) inhibiting serum and/or nuclease 
degradation of polynucleotides. 

0 Backeround of the Invention 

Polycations» such as polylysine, have been used to facilitate delivery of 
nucleic acids to cell interior. Both in vitro and in vivo applications have taken 
advantage of this property. See, for example, Gao el al., 1996, Biochem. 
35:1027-1036. 

5 Polynucleotides, typically DNA, may be taken into a cell by a receptor- 

mediated endocy tosis pathway, a cellular mechanism which internalizes specific 
macromolecules. In general, complexes designed to be delivered in this fashion 
contain nucleic acids encoding the gene of interest and a polycationic agent, 
which acts as a DNA-binding carrier and both neutralizes the charge on the 

D nucleic acids and condenses it. 

Condensation facilitates entry of nucleic acids into cell vesicle systems 
by simulating a macromolecular structure. For example, polylysine condenses 
DNA into a toroid or doughnut-like structure. (Wagner et al., 1991, Proc. Natl . 
Acad. Sci. 88:4255-4259). 

5 Polycations previously utilized for nucleic acid delivery to cell interiors 

include polylysine, protamines* histones, spermine, spermidine, polyomithine, 
polyargnine, and putrescine. 

All publications mentioned herein are incorporated herein by reference 
for the purpose of disclosing and describing features of the invention for which 

3 the publications are cited in connection with. 
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Summary of the Invention 

An embodiment of the invention is a vector for expression of 
polypeptides. The vector of the instant invention comprises: (i) an Epstein Barr 
Virus (EBV) origin of replication; (ii) a polynucleotide encoding an EBV origin 
binding protein; (iii) an enhancer; (iv) a promoter; and (v) a terminator. 
Polynucleotides encoding a desired polypeptide, such as erythropoietin or leptin 
can be insened into the vector. Also, ribozyme and antisense polynucleotides 
can also be inserted into the vector. 

One embodiment of the invention is a composition capable of targeting a 
polynucleotide to a specific cell type. The composition comprises: (i) a 
lipoprotein; (ii) a polynucleotide binding molecule; and (iii) a polynucleotide. 

Another embodiment of the invention is a method of increasing the 
frequency of uptake of polynucleotides into a cell by contacting a cell with a 
composition comprising: (i) a lipoprotein, (ii) a polynucleotide binding 
molecule; and (iii) a polynucleotide. 

Yet another embodiment of the invention is a method of increasing the 
frequency of uptake of polynucleotides into a specific cell type by contacting a 
population of cells with a composition comprising (i) a lipoprotein, (ii) a 
polynucleotide binding molecule; and (iii) a polynucleotide. 

One embodiment of the invention is a polycationic agent exhibiting a net 
positive electrical charge at physiological pH with the following formula: 



where Ta and Tc are terminating groups. A preferred subset of these compounds 
is the set where is hydrogen. Even more preferred are polymers comprising at 



hydrogen and R^ is not hydrogen, also referred to as poly N-substituted glycines 
or "poly NSGs." 

Another embodiment is a neutral polymer exhibiting no net positive or 
negative electrical charge at physiological pH with the following formula: 




least one unnatural amino acid. Also preferred are polymers where and are 
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where Ta and Tc are terminating groups. A preferred subset of these compounds 
is the set where R, is hydrogen. Even more preferred are polymers comprising at 
least one unnatural amino acid. Also preferred are polymers where Rj and Rj are 
hydrogen and R, is not hydrogen, also referred to as poly N~substituted glycines 
or ^'poly NSGs." 

The instant polycationic agents and neutral polymers are capable of 
neutralizing the electrical charge of nucleic acids. A subset of these compounds 
are capable of (1) condensing the structure of polynucleotides and/ or (2) 
protecting polynucleotides from semm and/or nuclease degradation. 

Yet another embodiment of the invention are polycationic agents and 
neutral polymers that (1) target binding of nucleic acids to cell surfaces, (2) 
trigger cell membrane destabilization; (3) exhibit endosome buffering capacity; 
(4) trigger endocytosis; (5) help trigger the release of polynucleotide/lipid 
complexes from endosomes or (6) nuclear tropism. 

Another embodiment of the invention is a composition comprising a 
polynucleotide of interest and an effective amount of the polycationic agent to 
neutralize the charge of the polynucleotide. Optionally, the composition includes 
a ligand which directs the complex to particular cells expressing a ligand-binding 
partner, and/or an endosomolytic agent, which serves to cause dismption of the 
endosome containing the complex. 

Another embodiment of the invention is a method of condensing nucleic 
acids by providing an effective amount of the polycationic agents or neutral 
polymers of the invention and contacting the agent with the desired 
polynucleotides. 

Also an embodiment of the invention is a method of inhibiting serum 
and/or nuclease degradation of nucleic acids by providing an effective amount of 
the the polycationic agents or neutral polymers of the inventions and contacting 
the agent with the desired nucleic acids. 



3 



BNSCXDCID: <WO 9806437A2_I_> 



wo 98/06437 PCT/US97/14465 



Brief Description of the Drawings 

Figure 1 is a schematic of a two-step monomer assembly reaction 
scheme. 

Figure 2 is a schematic of a three-step monomer assembly reaction 
5 scheme. 

Figure 3 is a plasmid map of vector pCMVKmlTR-EPI. 

Figure 4 is a plasmid map of vector CMVkm2. 

Figure 5 is a plasmid map of vector pCMV-KM-cmEPO. 

Figure 6 is a plasmid map of vector CMVKmLeptinWt. 
10 Figure 7 A illustrates transfection efficiencies for a diverse set of 

polycationic agents. The polycatonic agents were formulated with DNA at a 2: 1, 
+ to - charge ratio and added to either HT1080 (solid bar) or COS (stippled bar) 
in the presence of 10% serum. Luciferase activity was analyzed 48 hours post- 
transfection. Total cell protein was measured using a Pierce BCA assay and 
15 luciferase activity was normalized against total cell protein. 

Figure 7B illustrates the effect of oligomer length on transfection 
efficiency for polycationic agents having different numbers of the same repeating 
trimer motif. For both A and B each data point represents the average of 2 
experiments. 

20 Figure 8(A-C) shows RZ 145-1 peptoid-mediated transfection and 

transfection mediated by commercially available cationic liposome preparations. 
RZ 145-1 or the indicated lipid was formulated and added to cells in the presence 
(solid bar) or absence (stippled bar) of 10% serum. Luciferase and total cell 
protein activity were measured 48 hours after initial transfection. Cells lines are 

25 (Figure 8A) 293 human embryonic kidney cells, (Figure 8B) HT1080 human 

fibrosarcoma cells/and (Figure 8C) NIH03T3 mouse fibroblast cells. Each data 
point represents the average + strandard error of the mean of three transfections. 

Figure 9 illustrates the effect of chloroquine on transfection with RZ145- 
1 in (A) 293 cells and (B) HT1080 cells. The cells were transfected in the 

30 presence (black bar) or absence (stippled bar) of 100 uM chloroquine. Cells 
were lysed 48 hours post transfection and luciferase activity and total protein 
content were measured. 
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Detailed Description 
Definitions 

"Lipoproteins*' refers to polypeptides that are associated non-covalently 
with lipids in the bloodstream and are capable of binding to cellular receptors. 
5 Preferably, lipoproteins are those involved with transport and storage of lipids. 
Such proteins include, for example, chylomicrons, low density lipoprotein 
(LDL), very low density lipoprotein (VLDL), intermediate density lipoprotein 
(IDL), and high density lipoprotein (HDL). Also, included in the term are 
mutants, fragments, or fusions of the naturally occurring lipoproteins. Also, 
to modifications of naturally occurring lipoproteins can be used, such as acetylated 
LDL. 

Mutants^ fragments, fusions, or modifications of the naturally occurring 
lipoproteins are amino acid sequences that exhibit substantial sequence identity 
to naturally occurring lipoproteins or a fragment thereof. These polypeptides 

15 will retain more than about 80% amino acid identity; more typically, more than 
about 85%; even more typically, at least 90%. Preferably, these polypeptides 
will exhibit more than about 92% amino acid sequence identity with naturally 
occurring lipoproteins or fragment thereof; more preferably, more than about 
94%; even more preferably, more than about 96%; even more preferably, more 

20 than about 98%; even more preferably, more than about 99%. All of these 
polypeptides exhibit receptor binding properties of naturally occurring 
lipoproteins. Usually, such polypeptides exhibit at least about 20% receptor 
binding of naturally occurring lipoproteins. More typically, the polypeptides 
exhibit at least about 40%, even more typically the polypeptides exhibit at least 

25 about 60%; even more typically, at least about 70%; even more typically, at least 
about 80%; even more typically, at least about 85%; even more typically, at least 
about 90%; even more typically, at least about 95% receptor binding of the 
naturally occurring lipoproteins. 

"Polynucleotide binding molecule'* refers to those compounds that 

30 associate with polynucleotides^ and the association is not sequence specific. For 
example, such molecules can ( 1 ) aid in neutralizing the electrical charge of 
polynucleotide, or (2) facilitate condensation of nucleotides, or (3) inhibit serum 
or nuclease degradation. 

5 
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"Polycationic agent" refers generally to a polymer comprising positively- 
charged single units, although some non-positively charged units may be present 
in the polymer. The instant agents exhibit a net positive charge under 
physiologically relevant pH. Such agents are capable of neutralizing the charge 
5 of nucleic acids and can exhibit additional properties, such as condensation 

and/or serum protection of nucleic acids. Preferably, the agents comprises both 
amino acids and NSGs as monomeric units; also, preferred are agents comprising 
NSGs as monomeric units. 

"Physiologically relevant pH" varies somewhat between in vitro and in 
10 vivo applications. Typically, the physiological pH is at least 5.5; more typically, 
at least 6.0; even more typically, at least 6.5. Usually, physiologically relevant 
pH is no more than 8.5; more usually, no more than 8.0; even more usually, no 
more than 7.5. 

"Polynucleotide" or "nucleic acid" refers to DNA, RNA, analogues 

15 thereof, peptide-nucleic acids, and DNA or RNA with non-phosphate containing 
nucleotides. Additionally, these nucleic acids may be single-stranded, double- 
stranded, or chimeric single- or double-stranded molecules. 

The term "oligomer*' includes polymers such as poly NSGs, prodiiced by 
the submonomer process described herein and also in Zuckermann et aJ., supra . 

20 includes polymers, copolymers, and interpolymers of any length. More 
specifically, oligomers may comprise a single repeating monomer, two 
alternating monomer units, two or more monomer units randomly and/or 
deliberately spaced relative to each other. Regardless of the type of polyamide 
produced, the polyamide of the invention may be produced by the same general 

25 procedure which includes repeating a two-step or three step cycle wherein a new 
monomer unit is added in each cycle until an oligomer of desired length is 
obtained. The oligomer is preferably 2-100 moiiomers, more preferably 2-50, or 
18-28 monomers or 24 to 48 monomers in length. 

The term "frequency of uptake of polynucleotides into a cell" refers to an 

30 increase in the amount of polynucleotides actually taken up by a cell relative to 
the amount actually administered to the cell. The frequency of uptake of 
polynucleotides into a cell is increased if it is greater than the frequency of 
uptake of naked polynucleotides. For example, using in vitro transfection 
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methods, uptake of naked polynucleotides into mammalian cells is not usually 
detectable over background. Some frequency of uptake, however, can be 
detected when naked polynucleotides are delivered in vivo. The frequency of 
uptake in vivo and in vitro depends on the tissue type. The frequency of uptake 
5 can be measured by known methods for delecting the presence of 

polynucleotides, such as Northern, Southern, or Polymerase Chain Reaction 
(PCR) techniques. 

Usually, a composition or compound is capable of increasing the 
frequency of polynucleotide uptake into a cell if it induces a frequency of uptake 
10 that is at least 10% greater than the frequency of naked polynucleotide uptake; 
more usually, at least 15% greater; even more usually, 20% greater; even more 
usually, at least 30%; and up to 40% to 100% greater, and even 1,000% and 
10,000% greater. 

"Naked polynucleotides*' refers to polynucleotides that are substantially 
15 free from any delivery vehicle that can act to facilitate entry into the celL For 

example, polynucleotides are naked when free from any material which promotes 
transfection, such as liposomal formulations, charged lipids, such as Lipofectin® 
or precipitating agents such as CajCPO^),. 

"Effective amount to increase the frequency of polynucleotide uptake into 
20 cells'* refers to an amount that induces a frequency of polynucleotide uptake into 
a cell that is at least 10% greater than the frequency of naked polynucleotide 
uptake; more usually, at least 15% greater; even more usually, 20% greater; even 
more usually, at least 30%; even more usually, at least 40%. 

"Effective amount to neutralize nucleic acids" refers to the amount used 
25 to neutralize at least 10% of the electrical charge of the nucleic acid composition; 
more preferably; the amount refers to the amount used to neutralize at least 40%; 
even more preferably, the amount to neutralize 50% of the electrical charge; even 
more preferably, the amount to neutralize 60% of the electrical charge; even 
more preferably, the amount to neutralize 70% of the electrical charge; even 
more preferably, the amount to neutralize 80% of the electrical charge; and most 
preferably, at least 90% of the electrical charge of the nucleic acid composition 
of interest. 
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'^Condensation of nucleic acids" occurs when the polycationic agent thai 
is combined with nucleic acids, neutralizes the eiectricaJ charge of the nucleic 
acids and causes it to assume a reduced structure relative to uncomplexed nucleic 
acids. Preferably, condensation reduces the structure of nucleic acids to a size 
5 that can be internalized by structures present on cell surface membranes. 
Condensation can be measured by determining the charge of the nucleic 
acid/polycationic agent by gel electrophoresis, for example. Alternatively, an 
effective amount to condense nucleic acids can also be measured by the final size 
of the polycationic agent/nucleic acid complex. 

^0 "Effective amount to inhibit serum or nuclease degradation of nucleic 

acids" refers to the amount used to increase the half-life of the polynucleotide 
when exposed to serum and/or nucleases by at least 5 minutes as compared the 
uncomplexed nucleic acids; more preferably, the amount used to inhibit 
degradation by at least 10 minutes; even more preferably, the amount used to 

15 inhibit degradation by at least 30 minutes; even more preferably, the amount 
used to inhibit degradation by at least 45 minutes; even more preferably, the 
amount used to inhibit degradation by at least 60 minutes; even more preferably, 
the amount used to inhibit degradation by at least 90 minutes; and more 
preferably, the amount used to inhibit degradation by at least 120 minutes. 

20 A composition containing A is "substantially free of B when at least 

85% by weight of the total A+B in the composition is A. Preferably, A 
comprises at least about 90% by weight of the total of A+B in the composition, 
more preferably at least about 95% or even 99% by weight. 

"Immunogenicity" refers to the ability of a given molecule or a 

25 determinant thereof to induce the generation of antibodies with binding capacity 
to the molecule upon administration in vivo, to induce a cytotoxic response, 
activate the complement system, induce allergic reactions, and the like. An 
inunune response may be measured by assays that determine the level of specific 
antibodies in semm, by assays that determine the presence of a serum component 

30 that inactivates the polycationic agent/nucleic acid complex or conjugated gene 
delivery vehicle, or by other assays that measure a specific component or activity 
of the immune system. As discussed in more detail below, low immunogencity 
may be established by these assays. The terms "low inununogenicity," **reduced 

2 
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immunogenicity/* "lowered immunogenicity" and similar terms are intended to 
be equivalent terms. 

An "origin of replication" is a polynucleotide sequence that initiates and 
regulates replication of polynucleotides, such as an expression vector. The origin 
5 of replication behaves as an autonomous unit of polynucleotide replication 

within a cell, capable of replication under its own control. With certain origins 
of replication, an expression vector can be reproduced at a high copy number in 
the presence of the appropriate proteins within the cell. Examples of origins are 
the 2^1 and autonomously replicating sequences, which are effective in yeast; and 
10 the viral T-antigen, effective in COS-7 cells. 

General Methods and Detailed Description 
POLYNUCLEOTIDES 

Polynucleotides used in the instant invention can be used to express 
desired polypeptides, or can be, in themselves, therapeutic, such as ribozymes or 

15 antisense polynucleotides. Such polynucleotides can be used in in vitro ^ ex vivo^ 
and in vivo applications. 

Also, the polynucleotides of the invention can be vectors that express 
polypeptides, ribozymes, or antisense molecules. Vectors contain at least a 
promoter to initiate transcription operably linked to the coding region, ribozyme 

20 or antisense molecule. Other components that can be included in the vector are, 
for example: (1) a terminator sequence; (2) a sequence encoding a leader peptide 
to direct secretion; (3) a selectable marker; and (4) an origin of replication. 
An orgin of replication is not necessary. The polynucleotides to be delivered can 
be either replicating or non-replicating. Other components can be added as 

25 desired and convenient. 

The pbljoiucleotides and methods of the invention can be utilized with 
any type of host cell. The choice of promoter, terminator, and other optional 
elements of an expression vector will depend on the host cell chosen. The 
invention is not dependent on the host cell selected. Convenience and the 

30 desired level of protein expression will dictate the optimal host cell. A variety of 
hosts for expression are known in the art and available from the American Type 
Culture Collection (ATCC) (Rockville, Maryland, U.S.A.). Suitable bacterial 
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hosts suitable include, without limitation: Campylobacter, Bacillus, Escherichia, 
Lactobacillus, Pseudomonas, Staphylococcus, and Streptococcus. Yeast hosts 
from the following genera may be utilized: Candida, Hansenula, 
KluyveromyceSy Pichia, Saccharomyces, Schizosaccharomyces, and Yarrowia. 
5 Aedes aegyptij Bombyx mori, Drosophila melanogaster, and Spodoptera 

frugiperda (PCT Patent Publication No. WO 89/046699; Carbonell et al., 1985, 
J. ViroL 56:153: Wright. 1986. Nature 321:718: Smith etal., 1983, Moi. Cell . 
BioL 3:2156; and see generally, Fraser et al., 1989, In Vitro CelL Dev . Biol. 
25:225). 

10 Useful mammalian cell types for in vitro applications include for 

example, those cell lines available from the American Type Culture Collection 
(ATCC), Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney 
(BHK) cells, monkey kidney cells (CpS), human hepatocellular carcinoma cells 
(e.g.. Hep G2), human embryonic kidney cells, baby hamster kidney cells, mouse 

15 Sertoli cells, canine kidney cells, buffalo rat liver cells, human lung cells, human 
liver cells, mouse mammary tumor cells, as well as others. 

Useful cell types for in vivo or ex vivo applications include, without 
limitation, any tissue type, such as muscle, skin, brain, lung, liter, spleen, blood, 
bone marrow, thymus, heart, lymph, bone, cartilage, pancreas, kidney, gall 

20 bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, 
gland,.and connective tissue. 

A. In vitro and Ex vivo Vectors 

The polynucleotides encoding the desired polypeptides or ribozymes, or 
antisense polynucleotides can be transcribed and/or translated using the 

25 following promoters and enhancers as examples. The examples include, without 
limitation: the 422(aP2) gene and the stearoyl-CoA desaturase 1 (SCDl) gene, 
which contains suiuble adipocyte-specific promoters, as described in Christy et 
al„ 1989, Genes Dev . 1: 1323-1335. Synthetic non-natural promoters or hybrid 
promoters can also be used herein. For example, a T7T7/T7 promoter can be 

30 constructed and used, in accordance with Chen et al., 1994, Nucleic Acids Res . 
22:21 14-2120, where the T7 polymerase is under the regulatory control of its 
own promoter and drives the transcription of a polynucleotide sequence, which is 

10 
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placed under the control of another T7 promoter. The primary determinant for 
the fat-specific expression is an enhancer located at about >5 kb upstream of the 
transcriptional start site, as described in Ross et aJ., 1990, Proc . Nad . Acad. Sci . 
USA-87;9590-9594 and Graves et al., 1991 , Genes Dev . 5:428-437. Also 
5 suitable for use herein is the gene for the CCAAT/enhancer-binding protein 
C/EBPa, which is highly expressed when 3T3-L1 adioblast commit to the 
differentiation pathway and in mature post-mitotic adipocytes, as described in 
Birkenmeier et al., 1989. Gene Dev . 3:1 146-1 156. The recently isolated 
transcription factor PPARyZ, expressed exclusively in adipocyte tissues, as 

10 described in Tontonoz et al., 1994, Cell 79:1147-1 156, can also be used herein. 

Typical prompters for mammalian cell expression include the S V40 early 
promoter, the CMV promoter, the mouse mammary tumor virus LTR promoter, the 
adenovirus major late promoter (Ad MLP), and the herpes simplex virus promoter, among 
others. Other non-viral promoters, such as a promoter derived from the murine 

15 metallothionein gene, will also find use in mammalian constructs. Expression may be 
either constitutive or regulated (inducible), depending on the promoter. Typically, 
transcription temiination and polyadenylation sequences will also be present, located 3' to 
the translation stop codon. Preferably, a sequence for optimization of initiation of 
translation, located 5* to the coding sequence, is also present. Examples of transcription 

20 temiinator/poiyadenylation signals include those derived from S V40, as described in 

Sambrook et al., 1989, "Molecular Cloning, A Laboratory Manual," second edition. Gold 
Spring Harbor Press, Cold Spring Harbor, New York. Introns, containing splice donor 
and acceptor sites, may also be designed into the constructs of the present invention. 
Enhancer elements can also be used herein to increase expression levels of the 

25 mammalian constructs. Examples include the S V40 early gene enhancer^ as described in 
Dijkema et al., 1985, EMBO J. 4:761 , and the enhancer/promoter derived from the long 
terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman etal., 1982b, 
Proc . Natl. Acad . Sci. USA 79:6777, and human cytomegalovirus, as described in Boshart 
et al., 1985, Cell 41:521. A leader sequence can also be present which includes a 

30 sequence encoding a signal peptide, to provide for the secretion of the foreign protein in 
mammalian cells. Preferably, there are processing sites encoded between the leader 
fragment and the gene of interest such that the leader sequence can be cleaved either in 
vivo or in vitro. 

n 
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Other regulatory regions from viruses can be included in the polynucleotides of 
the instant invention to increase transcription and translation levels or increase the 
duration of transcription and translation. For example, the long terminal repeats of HIV 
can be included. Altematively, the inverted terminal repeats of the Epstein Barr Virus can 
5 be used. 

There exist expression vectors that provide for the transient expression in 
mammalian cells of DNA encoding the target polypeptide. In general, transient 
expression involves the use of an expression vector that is able to replicate 
efficiently in a host cell, such that the host cell accumulates many copies of the 

JO expression vector and, in turn, synthesizes high levels of a desired polypeptide 
encoded by the expression vector. Transient expression systems, comprising a 
suitable expression vector and a host cell, allow for the convenient positive 
identification of polypeptides encoded by cloned DNAs, as well as for the rapid 
screening of such polypeptides for desired biological or physiological properties. 

15 * Thus, transient expression systems are particularly useful for purposes of 
identifying analogs and variants of the target polypeptide that have target 
poly peptide-1 ike activity. 

B. In vivo Vectors 

For delivery using viral vectors, any of a number of viral vectors can be 

20 used, as described in Jolly, 1 994, Cancer Gene Therapv 1: 1-64. For example, the 
coding sequence of a desired polypeptide or ribozymes or antisense molecules 
can be inserted into plasmids designed for transcription and/or translation in 
retroviral vectors, as described in Kimura et al., 1994, Human Gene Therapv 
5:845-852, adenoviral vectors, as described in Connelly et al., 1995. Human Gene 

25 Therapv 6: 185-193, adeno-associated viral vectors, as described in Kaplitt et al., 
1994, Nature Genetics 6: 148-153 and sindbis vectors. Promoters that are 
suitable for use with these vectors include the Moloney retroviral LTR, CMV 
promoter and the mouse albumin promoter. Replication competent free vims can 
be produced and injected directly into the animal or humans or by transduction 

30 of an autologous cell ex vivo, followed by injection in vivo as described in 
Zatloukal et al., 1994, Proc. Natl . Acad, Sci . USA 91:5148-5152. 

9806437A2J_> 



WOj>8/06437 



PCT/US97/I4465 



The polynucleotide encoding a desired polypeptide or ribozyme or 
antisense polynucleotide can also be inserted into plasmid for expression of the 
desired polypeptide in vivo. For in vivo therapy, the coding sequence can be 
delivered by direct injection into tissue, or via oral administration as an aerosol. 
5 Promoters suitable for use in this manner include endogenous and heterologous 
promoters such as CMV. Further, a synthetic T7T7/T7 promoter can be 
constmcted in accordance with Chen et al., 1994, Nucleic Acids Res . 22:21 14- 
2120, where the T7 polymerase is under the regulatory control of its own 
promoter and drives the transcription of polynucleotide sequence, which is also 

10 placed under the control of a T7 promoter. The polynucleotide can be injected in 
a formulation that can stablize the coding sequence and facilitate transduction 
thereof into cells and/or provide targeting, as described in Zhu et al., 1993, • 
Science 261:209-21 L 

Expression of the coding sequence of a desired polypeptide or replication 

15 of a ribozyme or antisense polynucleotide in vivo upon delivery for gene therapy 
purposes by either viraJ or non-viral vectors can be regulated for maximal 
efficacy and safety by use of regulated gene expression promoters as described in 
Gossen et al., 1992, Proc. Natl . Acad. Sci . USA 89.-5547-5551. For example, the 
polynucleotide transcription and/or translation can be regulated by tetracycUne 

20 responsive promoters. These promoters can be regulated in a positive or 
negative fashion by treatment with the regulator molecule. 

For non-viral delivery of the coding sequence of the desired polypeptide, the 
sequence can be inserted into conventional vectors that contain conventional 
control sequences for high level expression. 
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C. Preferred Vector 

A preferred vector comprises: ( 1) an (EBV) origin of replication or a 
BKV (BK virus), a parvovirus, origin of replication; (2) a coding region for an 
EBV or BKV origin binding protein; (3) at least one inverted terminal repeat; (4) 
5 a promoter; (5) an enhancer; (6) a terminator; (7) optionally, a selectable marker. 

Preferably, the orgin of replication is EBV ori p; more preferably, 
nucleotides 2623 to 4559 of SEQ ID NO: 1 are utilized. The sequence is 
obtainable from vector pCEP4, conunercially available from Invitrogen, San 
Diego, California, USA. 
^0 Preferably, the coding region encodes the EBV nuclear antigen A, which 

binds to EBV ori p; more preferably, the polynucleotide sequence is nucleotides 
14 to 2594 of SEQ ID NO: 1 are utilized. The sequence is obtainable from vector 
pCEP4, conmiercially available from Invitrogen, San Diego, California, USA. 
Fragments and mutants of the preferred origin and binding protein 
1 5 capable of initiating replication of the vector in the desired host cell can be 

utilized. Preferably, the fragments and mutants will retain more than about 80% 
sequence identity with nucleotides 14 to 2594 or 2623 to 4559 of SEQ ID NO:l 
or fragment thereof; more typically, more than about 85%; even more typically, 
at least 90%. Preferably, these polynucleotides exhibit more than about 92% 
20 sequence identity with nucleotides 14 to 2594 or 2623 to 4559 of SEQ ID NO: 1 
or fragment thereof; more preferably, more than about 94%; even more 
preferably, more than about 96%; even more preferably, more than about 98%; 
even more preferably, more than about 99%. 

Preferably, the inverted terminal repeats are those sequences found in 
25 adenovirus (AV) or adeno-associated virus (AAV); more preferably, the inverted 
terminal repeats are those found in AAV; even more preferably, the 
polynucleotide sequence is 4938 to 5 1 04 or 7 1 89 to 7355 of SEQ ID NO: 1 . The 
sequence of AAV is described in Samulski et al., 1987, J. Virol . 61:3096-3101. 
Fragments and mutants of the preferred inverted terminal repeat capable 
30 of initiating replication of the vector in the desired host cell can be utilized. 

Preferably, the fragments and mutants will retain more than about 80% sequence 
identity with nucleotides 4938 to 5104 or 7 189 to 7355 of SEQ ID NO: lor 
fragment thereof; more typically, more than about 85%; even more typically, at 

'f 
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least 90%. Preferably, these polynucleotides exhibit more than about 92% 
sequence identity with nucleotides 4938 to 5 104 or 7 1 89 to 7355 of SEQ ID NO: 
lor fragment thereof; more preferably, more than about 94%; even more 
preferably, more than about 96%; even more preferably, more than about 98%; 
5 even more preferably, more than about 99%. 

Preferably, the cytomegalovirus enhancer/promoter is utilized; more 
preferably, the CMV promoter sequence is nucleotide sequence 51 12 to 6734 of 
SEQ ID NO: I. 

Mutants and fragments of the preferred enhancer and promoter capable of 
0 initiating transcription and/or translation can be utilized. Preferably, the 

fragments and mutants will retain more than about 80% sequence identity with 
nucleotides 51 12 to 6734 of SEQ ID NO: lor fragment thereof; more typically, 
more than about 85%; even more typically, at least 90%. Preferably, these 
polynucleotides exhibit more than about 92% sequence identity with nucleotides 
5 51 12 to 6734 of SEQ ID NO: lor fragment thereof; more preferably, more than 
about 94%; even more preferably, more than about 96%; even more preferably, 
more than about 98%; even more preferably, more than about 99%. 

A preferred terminator is the bovine growth hormone poly A sequence; 
more preferably, the polynucleotide sequence is nucleotide 6818 to 7050 of SEQ 
IDNO:l. 

Mutants and fragments of the preferred terminator capable of terminating 
transcription and/or translation can be utilized. Preferably, the fragments and 
mutants will retain more than about 80% sequence identity with nucleotides 
6818 to 7050 of SEQ ID NO: lor fragment thereof; more typically, more than 
about 85%; even more typically, at least 90%. Preferably, these polynucleotides 
exhibit more than about 92% sequence identity with nucleotides 68 1 8 to 7050 of 
SEQ ID NO: lor fragment thereof; more preferably, more than about 94%; even 
more preferably, more than about 96%; even more preferably, more than about 
98%; even more preferably, more than about 99%. 

The sequence of the preferred vector is shown in SEQ ID NO: 1 . 
Polynucleotides encoding polypeptides, such as eiythropoeitin or leptin, and 
ribozymes and antisense polynucleotides can be inserted into the vector. 

IS 
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t>. Examples of Coding Regions, Ribozvmcs> and Antisense Molecules 
The following are examples of coding regions, ribozymes, and antisense 
molecules that can be used to treat various indications in mammals. The 
nucleotide sequence of the genes of interest can be found, for example, in 
5 publically available databases, such as Genbank. Polynucleotides to be delivered 
can be used to treat viral infections or chronic pathogen infection. 

1. Hemophilia 

Gene replacement by in vivo delivery of polynucleotides can be effective 
in treating hemophilia. The following are examples of polypeptides that can be 
10 encoded by the polynucleotides to be delivered: Factor VIILC, mutants of 
Factor VIILC, preferably those that are uncleavable. Also, useful to treat 
hemophilia are ribozyme and antisense polynucleotides as inhibitors of Tissue 
Factor Plasminogen Inhibitor (TFPI), 

The routes of delivery for treating hemophilia include, for example, 
15 intravenous/intrahepatic injection, ex vivo transduction of stem cells or 
lymphocytes using retroviral vectors. 

2. Treatment of Graft Versus Host Disease 

In vivo delivery of polynucleotides encoding prodrugs can be used for 
direct ablation to treat graft versus host disease in, for example, leukemia bone 
20 marrow transplantation. Herpes thymidine kinase in conjunction with 

gancyclovir can be utilized for this purpose. Other examples of prodmgs are 
described in the cancer section. 

The routes of delivery for treating graft versus host disease include, for 
example, ex vivo transduction of T-lymphocy tes using retroviral vectors. 

25 3. Vaccines 

In vivo delivery of polynucleotides encoding a desired antigen can be 
utilized to induce an immune response. This response can include both cellular 
and humoral response. This type of vaccine can be used to treat cancer as well as 
infectious diseases. Further, such treatment can be either prophylactic or 

30 therpeutic immunotherapy. 
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Examples of infectious diseases include. Human Immunodeficiency 
Virus (HIV), Hepatitis A, B, C, etc., (HAV, HBV, HCV, etc.). Human Papiloma 
Virus (HPV), cytomegalovirus (CMV), herpes simplex 1 and 2 (HSV), etc. 
Preferred antigens include non-structural proteins 3, 4a, and 5b (NS3, NS4, and 
5 NS5b) of HCV; gB2 and gD2 of HSV; env and rev proteins of HIV. 

Also, cancer antigens can be used in vaccines, for both therapeutic and 
prophylactic purposes. 

The antigens can be presented in the context of Class I major 
histocompatibility antigens, or to induce a cellular cytotoxic T cell response, or 
10 to induce a humoral response comprising the synthesis of antibodies. 

In addition, an antisense or ribozyme target to a immune suppressive 
molecule, IL-10, TGF-p, and CTLA-4, for example, can be useful to be 
administered with a vaccine. 

The routes of delivery for vaccines include, for example, intramuscular 
15 injection, dendritic cell-based inmiunization, or oral inmiunization by both viral 
and non-viral vectors. 



4. Diabetes Mellitus 

Diabetes is another indication that can be treated by in vivo delivery of a 
replacement gene. The following are examples of useful polypeptides to be 
20 encoded by the replacement gene: insulin, insulin-like growth factor I and II 
(IGF-I and II). 

Also useful for treating diabetes are polynucleotides encoding lAS-L, 
found on the surface of B ceils in the pancreas, to protect the cells from immune 
destruction. 

25 The routes of delivery for treating diabetes include, for example, liver- 

directed, parotid- directed, pancreas-directed, salivary gland-directed using both 
viral and non-viral vectors. 



5. Hvperlipidemia 

Hyperlipidemia can be treated by in vivo delivery of the following 
30 polynucleotides encoding apoproteins or lipoprotein receptors. A more extensive 
description of lipoproteins and apoproteins is provided below. 

/7 
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The routes of delivery for treating hyperlipidemia include, for example, 
liver-directed intravenous administration by both viral and non- viral vectors. 



6. Mvocardial Ischemia or Infarction 

The following are examples of polynucleotides that are useful, when 
5 delivered in vivo, to treat myocardial ischemia or infarction: 

polynucleotides encoding basic fibroblast growth factor (bFGF), 
fibroblast growth factor 5 (FGF-5) and IGF-I. 

The routes of delivery for treating myocardial ischemia or infarction 
include, for example, intrapericardial delivery of viral vector or non-viral 
10 vectors. 



7. Bowel Disease 

The following are examples of polynucleotides that can be delivered in 
vivo to treat bowel disease: 

(i) ribozymes or antisense polynucleotides as inhibitors of 

15 macrophage/inflanunatory cell recruitment or activation, such as NFkB; 

(ii) ribozymes or antisense polynucleotides to act as anti-apoptotic 
agents, such as inhibitors of interleukin lb converting enzyme family; 

(iii) polynucleotides encoding complement blockers, such as decay 
accelarting factor (DAF), membrane cofactor protein (MCP); and the fusions of 

20 DAF and MCP also known as CAB-2; 

(iv) cyclboxygenase inhibitors; 

(v) anti-proliferaiive agents, such as, ribozymes, antisense 
oligonucleotides, antibodies, protein, or peptides against c-myb, ras/raf, PI3 
kinase, cyclins; 

25 (vi) polynucleotides encoding suicide proteins/genes, such as, herpes 

thymidine kinase; 

(vii) polynucleotides encoding replacement genes or proteins which 
maybe deficient or down regulated during the devleopment of inflammatory 
bowel disease. 

30 (viii) polynucleotides encoding IkB. 



18 



BNSDCXiD: <WO__9806437A^I_> 



wo 98/06437 



PCT/US97/i4465 



8. Prostate Cancer and Benign Prostatic Hyperplasia 
The following polynucleotides can be delivered to treat prostate cancer 
and benign prostatic hyperplasia: 

(i) a polynucleotide encoding a pro-apoptotic agent, including for 

5 example, fas, fas hgand, fadd, fap-1, tradd, faf, rip, reaper, apoptin, interleukin-2 
converting enzyme; 

(ii) a polynucleotide encoding an anti- angiogenic agent, including, for 
example, bFGF soluble receptor and fragments, angiostatin, transforming growth 
factor-p (TGF-P), interferon-a (IFNa), proliferin-related protein, a urokinase 

0 plasminogen activator receptor antagonist, platelet factor 4 (PF4), 

thrombospondin, a tissue inhibitor of metailoproteinase, and prolactin; 

(iii) a polynucleotide encoding a inmiunomodulating agent including, 

for example, interleukin-2 (IL-2), IFNa, IFNp, IFNy, granulocyte macrophage- 

colony stimulating factor (GM-CSF), and macrophage-colony stimulating factor 
(M-CSF); 

(iv) a ribozyme or antisense polynucleotide as an antiproliferative agent 
including, for example, an inhibitor of a signal transduction pathway, for 
example, an inhibitor of a signal transduction pathway mediated by myb, ras» ras 
superfamily, raf, phosphoinositol (PI3-kinase), a phosphotyrosine binding (PTB) 
domain, a SRC homology-2 (SH2) domain, a SRC homology-3 (SH3) domain, a 
plexthn homology (PH) domain, JUN kinase, and a stress activated kinase, 
signaling inositol phosphatases; and an inhibitor of a cyclin; 

(v) a ribozyme or antisense polynucleotide as an inhibitor of a growth 
factor or inhibitor of a receptor of a growth factor, including, for example, 
epidermal growth factor (EOF), TGF-a, FGF, TGF-p. platelet derived growth 
factor (PDGF), keratinocyte growth factor (KGF), or any prostate cell specific 
growth factor; 

(vi) a polynucleotide encoding a tumor suppressor gene or a gene down- 
regulated during the onset of a hyperplastic condition in the prostate; and 

(vii) an antisense or ribozyme target to a inunune suppressive molecule, 
IL-iO, TGF-p, and CTLA-4, for example. 

I > 
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9. Anemia. Leukopenia, and Thrombocytopenia 

Anemia can be treated by in vivo delivery of a polynucleotide encoding 
erythropoietin, GM-CSF-, G-CSF, M-CSF, and thrmobopoietin, for example. 
Examples of delivery routes for this indication include without limitation: liver- 

5 targeted intravenous administration of viral vectors and non-viral vectors. See 

\. ■ 

the Examples below. 

10- Cardiomvopathv 

The following are examples of polynucleotides that can be delivered in 
vivo to treat cardiomyopathy: polynucleotides encoding, IGF-1, L-amino acid 
10 decarboxylas, inhibitors of p adrenergic receptor icinases (BARK), troponin, and 
P adrenergic receptors. 

Examples of delivery routes for this indication include, without 
limitation, pericardial expression of IGF-1, and for the other genes, 
intramycardial injection or myocardial trageting via intracoronary injection or 
15 intrapericardial administration of viral vectors or non-viral vectors. 

1 1. Rheumatoid Arthritis 

The following are examples of polynucleotides that can be delivered in 
vivo to treat rheumatoid arthritis, polynucleotides encoding a prodrug, such as 
herpes thymidine kinase, MMP inhibitors, fas, and pro-apoptotic proteins, 
20 described above, and interleukin- 1 receptor A, interleukin- 1 0, IicB. 

Also, antisense and ribozyme polynucleotides as inhibitors of NFkB. 

Examples of delivery routes for this indication include, without 
limitation, intraarticular injection of viral and non-viral vectors. 

12. Osteoarthritis and Psoriasis 

25 The following are examples of polynucleotides that can be delivered in 

vivo to treat osteoarthritis and psoriasis: polynucleotides encoding IGF-1 ; 
ribozyme and antisense polynucleotides as inhibitors of metalloproteinase 
inhibitors. 

Also, the following are examples of polynucleotides that can be delivered 
30 in vivo to treat osteoarthritis and psoriasis, polynucleotides encoding a prodrug, 

2P 
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such as herpes thymidine kinase, MMP inhibitors, fas, and pro-apoptotic 
proteins, described above, and interleulcin- 1 receptor A, interleukin-lO, IicB. 
Also, antisense and ribozyme polynucleotides as inhibitors of NFicB 
Examples of delivery routes for this indication include, without 
5 limitation, intraarticular injection. 



13. Restenosis 

The following are examples of polynucleotides that can be delivered in 
vivo to treat restenosis: 

(i) polynucleotides encoding a prodrug, such as thymidine kinase, 
10 other examples are described in the cancer section; 

(ii) polynucleotides encoding tissue factor plasminogen inhibitor 

(TFPI); 

(iii) polynucleotides encoding c-myb rbz, c-ras rbz, 

(iv) polynucleotides encoding pro-apoptotic agents, described above; 
15 (v) polynucleotides encoding IkB. 

Examples of delivery routes for this indication include, without 
limitation, intracoronary delivery of viral and non-viral vectors. 



14, Cancer 

The gene delivery vectors of the invention are useful in delivering 
20 therapeutic genes for treatment of hyperproliferative disorders, including 
malignancy, for treatment of infectious disease and for treatment of 
inflammatory diseases, including autoinunune disease. For instance, the gene 
therapy vectors can be used to express cytokines or proteins that convert an 
inactive or partially active prodrug into an active drug. In many cases, 
25 conversion of the prodrug into its active form results in a compoimd with 
cytolytic activity. 



a. Prodrug converting enzymes 
A number of "suicide genes" which encode different proteins useful in 
30 prodrug conversion can be used in the instant invention. For instance, nucleoside 
kinases such as thymidine kinase are panicularly useful. In particular, the HSV- 
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TK system has important advantages for anti-tumor cell therapy. See PCT 
publication number WO 91/02805 entitled "Recombinant Retroviruses 
Delivering Vector Constructs to Target Cells" and PCT publication number WO 
95/14014091 entitled "Compositions and Methods for Utilizing Conditionally 
5 Lethal Genes" for a description of treatment of cancer and other diseases by gene 
delivery vectors expressing thymidine kinase and other prodrug converting 
enzymes. HS V-TK transduced tumor cells can mediate a significant bystander 
killing effect on untransduced neighboring cells in vitro and in vivo (Moolten el 
al', supra .. Freeman et al., 1993, Cancer Res. 53:5274), most commonly as a 

10 result of transfer to the toxic ganciclovir metabolite, GCV triphosphate, between 
adjacent cells through intercellular gap junctions (Bi et aj., 1993, Human Gene 
Therap . 4:725). Endothelial cells in capillary walls are. connected by gap 
junctions, so a dramatic bystander effect created by GCV-triphosphate transfer 
between neighboring endothelial cells and the massive amplification effects of 

15 the clotting cascade and the tumor to endothelial cell ratio could ensue 

(Denekamp et al., 1986. Cancer Topics 6:6: Denekamp et al., 1984, Prop . AppL 
Microcir . 4:28). Recent evidence suggests that the occasional transduction of 
tumor endothelial cells during intralesionai therapy with HSV-TK retroviral 
vectors may account for a significant component of the antitumor activity of the 

20 vectors (Ram et al., 1994, J. Neurosurp . 81:256). In addition, the suicide gene is 
only conditionally cytotoxic to the target cells (i.e. only when GCV is given). 
Consequently, an ex vivo administration method can be be utilized. For example, 
in this type of protocol, endothelial cells may be isolated from tumor biopsies 
(Medzelewski et al., 1994, Cancer Res . 54:336), induced to proliferate with 

25 appropriate mitogens (Ferrara et al., supra . 1 and transduced with TK in vitro. 
Transplanted EC become incorporated into the neovasculature in days to weeks 
after intratumoral injection (Lai et al-» 1994, Cancer Gene Therap . 1:322^, so 
GCV treatment would follow a suitable lag phase* to allow the transduced EC to 
integrate functionally in to the tumor vasculature. The two-step enzyme-prodrug 

30 system offers greater flexibility of delicate clinical management, because 
cessation of GCV infusion in the event of (potentially very serious) 
complications arising from damage to normal EC, would block toxicity without 
the need to block transgene activity in situ. 
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A number of alternative 'suicide genes' in addition to thymidine kinase 
may also be useful for cancer gene therapy (Moolten et al., supra .). Introduction 
of the bacterial cylosine deaminase gene (Huber et al., 1993, Cancer Res. 
53:4619) into tumor cells confers sensitivity to the antifungal agent 5- 
5 fluorocytosine (5-FC). Cytosine deaminase converts 5-FC to 5-fluorouracil (5- 
FU, Nishiyama et al., 1985, Cancer Res . 45 : 1753). Since 5-FU is commonly 
used chemotherapeutic drug for breast cancer, several groups have developed 
cytosine deaminase-based 'suicide gene* therapy models for this disease. Tumor 
specificity may be further increased by introducing the c-erbB2 

10 promoter/enhancer elements 5' to the cytosine deaminase gene, so that the 
therapeutic transgene is preferentially transcribed in c-erbB2-overexpressing 
breast tumor cells (Harris et al., 1994, Gene Therap , 1:170). Alkaline 
phosphatase has been widely explored as prodrug-activating enzyme in the 
related field of antibody directed enzymne-prodrug therapy (ADEPT). This 

15 enzyme has the advantage that it can activate a wide range of phosphorylated 

derivatives of anticancer agents {e.g. mitomycin C, etoposide, etc.) that cannot . 
cross cell membranes until the charged phosphate group is cleaved off, so a 
single enzyme could generate de novo a cocktail of chemotherapeutic agents 
within the tumor mass (Senter et aJ., 1993, Bioconiugate Chem . 4:3). Other 

20 suicide genes may encode a polypeptide or polypeptides (with a corresponding 
non-cytotoxic agent) such as Herpes Simpex virus thymidine kinase (gancyclovir 
or acyclovir). Varicella Zoster virus thymidine kinase (6 methoxypurine arabino 
nucleoside; Huber ^ al., 1991, Proc . Natl , Acad . Sci , USA 88:8039), £. coii 
cytosine deaminase (fluorouracil; Mullen et al., 1992, Proc . Natl. Acad. Sci. 

25 USA . 89:33), E. coli xanthine-guanine phophoribosyl transferase (thioxanthine; 
Beshard et al., 1987, Mol. Cell Biol . 7:4139), E. coli or Leishmania purine 
nucleotide phosphorylase (various nontoxic purine deoxyadenosine, adenosine, 
deoxyguanosine, or guanosine derivatives (Koszaika and Krenitsky, 1979, J. Biol 
Chern 254:8185, 1979; Sorscher et al., 1994. Gene Thcrapv 1:233>, cytochrome 

30 pla50 2B 1 or cytochrome p450 reductase (e.g., Samino- 1,2,4 benzotriazine 1 ,4- 
dioxide (Walton gL al- 1992, Biochem . Pharmacol . 44:25 1 ), cell surface alkaline 
phosphatase {e.g., etoposide monophosphate; Senter etal., 1988, Proc . Natl. 
Acad . Sci . USA 85:4842, 1988), nitroreductase {e.g., metronidazole or 
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nitrofurantoin; Hof et al., 1988, Immunitat und Infektion j6:220), N-deoxyribosy 
transferase (l-deazapurine; Betbeder et al., 1989. Nucleic Acids Res 17 :4217). 
pyruvate ferrodoxin oxidoreductase (metronidazol; Upcroft et al., 1990, Int. J. 
Parasitolog. 20:489), carboxypepidase G2 (aminoacylate nitrogen mustards; 
Antoniw et al., 1990, Brit J. Cancer 62:909), carboxypeptidase A (methotrexate 
alpha alanine; Haenseler et al., 1992, Biochemistry 31:891). ♦ lactamase 
(cephalosporin derivatives; Meyer et 1993, Cancer Res , 53:3956; and 
Vradhula et al., 1993, Bioconiugate Chemistry 4:334), Actinomycin D 
synthetase complex (synthetic pentapeptide lactone precursors; Katz et al., 1990, 
J. Antibiotics 43:23 1 ), and •-glucuronidase (various glucuronide derivatives of 
toxic drugs such as doxorubicin; Bosslet et ai., 1994, Cancer Res . 54:2151; 
Haeberlin et al., 1993, Pharmaceutical Res . JO: 1553). 

Any of a variety of other enzymes which convert inactive prodrugs into 
active drugs and known to those of skill in the art can also used in the gene 
delivery vehicles of the invention. For example, see PCX publication number 
WO 95/14014091 entitled "Compositions and Methods for Utilizing 
Conditionally Lethal Genes", and EurofDean Patent publication number 
EP90309430, entitled "Molecular Chimeras Useful for Cancer Therapy- 
Comprising Regulatory Sequences and heterologous enzyme, e.g. Varicella 
Zoster Virus Thymidine Kinase" for a description of additional prodrug/enzyme 
systems useful for gene therapy. As an additional example, see PCT Patent 
Publication No. WO 95/13095 entitled "New Prodrugs and Enzyme Targeting 
Molecule Conjugates—Useful in Antibody Direct Enzyme Prodrug Therapy of 
e.g. Viral Infections". 

A variety of tumors may be targeted for treatment by the gene delivery 
vehicles of the invention. In general, solid tumors are preferred, although 
leukemias and lymphomas may also be treated if they have developed a solid 
mass, or if suitable tumor associated markers exist such that the tumor cells can 
be physically separated from nonpathogenic normal cells. Representative 
examples of suitable tumors include melanomas, colorectal carcinomas, lung 
carcinomas (including large cell, small cell, squamous and adeno-carcinomas). 
renal cell carcinomas and breast adeno-carcinomas. Gene delivery vehicles 
expressing thymidine kinase and other prodrug converting enzymes are also 
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useful in the treatment of autoimmune diseases including rheumatoid arthritis, 
osteoarthritis and graft vs. host disease. See e.g. PCT Patent Publication No. 
WO 95/14091, entitled "Compositions and Methods for Utilizing Conditionally 
Lethal Genes," for a description of treatment of disease with gene therapy vectors 
5 expressing prodrug converting enzymes. 

b. Cytokines 

A variety of polynucleotides encoding cytokines and immune system 
modulators can be delivered by the gene delivery vehicles of the invention for 

ID treatment of a number of different disorders. Representative examples include 
cytokines, such as IL-1, IL-2 (Karupiah et al., 1990, J. Immunology 144 :290- 
298; Weber etal., 1987, J. Exfi. Med. 166:1716-1733; Gansbacheretal^^ 1990, J. 
ExD . Med. 172:1217-1224; U.S. Patent No. 4,738,927), lL-3, IL-4 (Tepper et al., 
1989. Cell 52:503-512; Golumbeketal., 1991, Science 254:713-716, 1991; U.S. 

15 Patent No. 5,017,691), IL-5, IL-6 (Brakenhof et al., 1987, J. Inmiunol . 139:41 16- 
4121; WO 90/06370), IL-7 (U.S. Patent No. 4,965,195) , IL-8, IL-9, IL-10, DL- 
1 1, IL-12, IL-13 f Cvtokine Bulletin . Summer 1994), IL-14 and IL-15, 
particularly IL-2, IL-4, IL-6, IL-12, and IL-13, alpha interferon (Finter et al., 
1991, Dru£s 42(5):749-765; U.S. Patent No. 4,892,743; U.S. Patent No. 

20 4,966,843; WO 85/02862; Nagata et al., 1980, Nature 284 :3 1 6-320; Familletii 
etal., 198 L Methods in Enz. 78:387-394: Twu et al.. 1989, Proc. Natl . Acad . 
Sci. USA 86:2046-2050; Faktor et al., 1990, Oncogene 5:867-872), beta 
interferon (Self et al., 1991, J. ViroL ^:664-671), ganmia interferons (Radford 
et al.. The American Society of Hepatology 2008-2015, 1991 ; Watanabe et al., 

25 PNAS 86:9456-9460, 1989; Gansbacher et aJ., 1990, Cancer Research 50:7820- 
7825; Maioet al., 1989. Can . Immunol. Immunother . 30:34-42; U.S. Patent No. 
4,762,791 ; U.S. Patent No. 4,727,138), G-CSF (U.S. Patent Nos. 4,999,291 and 
4,810,643), GM-CSF (WO 85/04188), tumor necrosis factors (TNFs) (Jayaraman 
et al., 1990, J. Immunology 1 44 :942-95 n, CD3 (Krissanen et al., 1987, 

30 Immunogenetics 26:258-266, 1987), ICAM-1 (Altman et al., 1989, Nature 
338:512-514: Simmons etal., 1988. Nature 331 :624-627^ ICAM-2, LFA-1, 
LFA-3 (Wallner et al., 1987, J. Exb. Msd. 166(4): 923-932), MHC class I 
molecules, MHC class II molecules, B7.1-.3. 2'nucroglobulin (Fames etal., 
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1981, Proc . Natl . Acad . Sci. 28:2253-2257), chaperones such as calnexin, MHC 
linked transporter proteins or analogs thereof (Powis et aJ., 1991 , Nature 
354:528-531, 1991). 

Genes encoding any of the cytokine and inununomodulatory proteins 
5 described herein can be expressed in a gene delivery vehicle of the invention. 
Other forms of these cytokines which are known to those of skill in the art can 
also be used. For instance, nucleic acid sequences encoding native IL-2 and 
gamma- interferon can be obtained as described in U.S. Patent Nos. 4,738,927 
and 5,326,859, respectively, while useful muteins of these proteins can be 

10 obtained as described in U.S. Patent No.4,853,332, As an additional example, 
nucleic acid sequences encoding the short and long forms of mCSF can be 
obtained as described in US Patent Nos. 4, 847,201 and 4,879,227, respectively. 

Other nucleic acid molecules that encode cytokines, as well as other 
nucleic acid molecules that are advantageous for use within the present 

15 invention, may be readily obtained from a variety of sources, including, for 
example, depositories such as the American Type Culture Collection (ATCC, 
Rockville, Maryland), or from conmiercial sources such as British Bio- 
Technology Limited (Cowley, Oxford England). Representative examples 
include BBG 1 2 (containing the GM-CSF gene coding for the mature protein of 

20 127 amino acids), BBG 6 (which contains sequences encoding gamma 

interferon), ATCC No. 39656 (which contains sequences encoding TNF), ATCC 
No. 20663 (which contains sequences encoding alpha interferon), ATCC Nos. 
31902, 31902 and 39517 (which contains sequences encoding beta interferon), 
ATCC No 67024 (which contains a sequence which encodes Interleukin- 1 b), 

25 ATCC Nos. 39405, 39452, 39516, 39626 and 39673 (which contains sequences 
encoding Interleukin-2), ATCC Nos. 59399, 59398, and 67326 (which contain 
sequences encoding Interleukin*3), ATCC No. 57592 (which contains sequences 
encoding Interleukin-4), ATCC Nos. 59394 and 59395 (which contain sequences 
encoding Interleukin-5), and ATCC No. 67153 (which contains sequences 

30 encoding Interleukin-6). 

Gene delivery vehicles expressing the above cytokines are useful in the 
treatment of a variety of disorders. For example, see PCT publication number 
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US94/02951 entitled "Compositions and Methods for Cancer Immunotherapy " 
for a description of gene therapy treatment of malignancy. 



15. Neurological Disorders and Diseases 

Polynucleotides encoding tyrosine hydroxylase can be useful in treating 
Parkinson disease. 

For stroke or any acute brain injuries, polynucleotides encoding IGF-1, 
bFGF, vascular endothelial growth factor (VEGF) are useful. 



16. Pulmonetrv Disorders 

For treating emphysema, polynucleotides encoding a 1 -anti-try psin are 

to useful. 

For treating lung fibrosis, polynucleotides encoding superoxide 
dismutase (SOD) are useful. 

For treating cystic fibrosis, polynucleotides encoding CFTR are useful. 



15 ADDITIONAL AGENTS 

Additional agents can be included with the desired polynucleotides to be 
delivered. These additional agents can facilitate endocytosis of the desired 
nucleic acids or aid binding of the nucleic acids to the cell surface or both, for 
example. 



20 A. Polypeptides 

One example are polypeptides which include, without limitation: 
asialoorosomucoid (ASOR); transferrin; asialoglycoproteins; antibodies; 
antibody fragments; ferritin; interleukins; interferons, granulocyte, macrophage 
colony stimulating factor (GM-CSF), granulocyte colony stimulating factor (G- 

25 CSF), macrophage colony stimulating factor (M-CSF), stem cell factor and 
erythropoietin. Viral antigens, such as envelope proteins, can also be used. 
Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RU. 



27 
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B. Hormones, Vitamins. Etc. 

Other groups that can be included are, for example: hormones, steroids, 
androgens, estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polvalkvlenes. Polysaccharides, Etc. 

5 Polyalkylene glycols can be included with the desired polynucleotides. 

In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred 
embodiment of this aspect, the polysaccharide is dextran or DEAE-dextran. 
Also, chitosan and poly(lactide-co-glycolide) 

10 D. Lipids and Liposomes 

The desired polynucleotide can also be encapsulated in lipids or packaged 

in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are 

able to stably bind or entrap and retain nucleic acid. The ratio of condensed 
15 polynucleotide to lipid preparation can vary but will generally be around 1 : 1 (mg 

DN Aimicromoles lipid), or more of lipid. For a review of the use of liposomes 

as carriers for delivery of nucleic acids, see. Hug and Sleight, 1991, Biochim . 

Biophvs . Acta , 1097 :1-1 7; Straubinger et aL, in METHODS OF 

ENZYMOLOGY (1983), Vol. 101, pp. 512-527. 
20 Liposomal preparations for use in the instant invention include cationic 

(positively charged), anionic (negatively charged) and neutral preparations. 

Cationic liposomes have been shown to mediate intracellular delivery of plasmid 

DNA (Feigner eial„ 1987, Proc. Natl . Acad , Sci, USA 84:7413-7416^! mRNA 

(Malone et al„ 1989, Proc , Natl , Acad , Sci . USA 86:6077-608 1); and purified 
25 transcription factors (Debs et al. , 1 990, J. Biol . Chem . 265 : 1 0 1 89- 1 0 1 92), in 

functional form. 

Cationic liposomes are readily available. For example, N[ 1-2,3- 
dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the product line Lipofectin"^, from GIBCO BRL, Grand Island, NY. (See, 
30 also. Feigner eiai., 1987, Proc . Natl , Acad. Sci. USA 84:74 13-74 16V Other 
commercially available liposomes include transfectace (DDAB/DOPE) and 

.9806437A2J_> 



wo 98/06437 



PCTAJS97/14465 



DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from 
readily available materials using techniques well known in the art. See, e.g., 
Szokaet ai., 1978. Proc. Natl . Acad . Sci. USA 75:4194-4198; PCT Publication 
No, WO 90/1 1092 for a description of the synthesis of DOTAP (1,2- 
5 bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as 
from Avanti Polar Lipids (Birmingham, AL), or can be easily prepared using 
readily available materials. Such materials include phosphatidyl choline, 
cholesterol, phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), 

10 dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl ethanolamine 
(DOPE), among others. These materials can also be mixed with the DOTMA 
and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilamellar vesicles (MLVs), small 

15 unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various 
liposome-nucleic acid complexes are prepared using methods known in the art. 
See, e.g., Straubinger et al., in METHODS OF IMMUNOLOGY (1983), Vol. . 
101, pp. 512-527; Szoka et al-, 1978, Proc. Natl . Acad . Sci. USA 75:4194-4198; 
Papahadjopoulos et al., 1975, Biochim . Biophvs . Acta 394 :483: Wilson et al., 

20 1 979, Cell 17:77; Deamer and Bangham, 1976, Biochim . Bioohvs . Acta 

443:629; Ostro et al., 1977, Biochem . Biophvs . Res . Conmiun . 76:836: Fraley et 
al., 1979, Proc . Natl . Acad. Sci . USA 76:3348): Enoch and Strittmatter, 1979, 
Proc , Natl . Acad . Sci . USA 76: 145); Fraley et al., 1980, J. Biol . Chem. 
255: 1043 1; Szoka and Papahadjopoulos, 1978, Proc . Natl. Acad. Sci. USA 

25 75: 145; and Schacfer-Ridder et al-, 1982, Science 215: 166. 

E. Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, 
IDL, LDL, and VLDL. Mutants, fragments, or fusions of these proteins can also 
30 be used. Also, modifications of naturally occurring lipoproteins can be used, 
such as acetylated LDL. These lipoproteins can target the delivery of 
polynucleotides to cells expressing lipoprotein receptors. Preferably, if 
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lipoproteins are including with the polynucleotide to be delivered, no other 
targeting ligand is included in the composition. 

If lipoproteins are included with the desired polynucleotides to be 
delivered, preferably, the composition comprises: (1) lipoprotein; (2) 
5 polynucleotide; and (3) a polynucleotide binding molecule. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. 
The protein portion are known as apoproteins. At the present, apoproteins A, B, 
C, D, and E have been isolated and identified. At least two of these contain 
.several proteins, designated by Roman numerals, AI, All, AIV; CI, CII, CIII. 
10 A lipoprotein can comprise niore than one apoprotein. For example, 

naturally occurring chylomicrons comprise A, B, C, and E, over time these 
lipoproteins lose A and acquire C and E apoproteins. VLDL comprises A, B, C, 

I 

and E apoproteins, LDL comprises apoprotein B; and HDL comprises 
apoproteins A. C, and E. 
15 The amino acids of these apoproteins are known and are described in, for 

example, Breslow, 1985» Annu Rev. Biochem 54:699; Law et al., 1 986, Adv. 
ExE Med. Biol. 151 : 1 62; Chen et al., 1 986, J Bioi Chem 261 : 1 29 1 8; Kane el al., 
1980, Proc Natl Acad Sci USA 77:2465; and Ulermann et al., 1984, Hum Genet 
65:232. 

20 Lipoproteins contain a variety of lipids including, triglycerides, 

cholesterol (free and esters), and phopholipids. The composition of the lipids 
varies in naturally occurring lipoproteins. For example, chylomicrons comprise 
mainly triglycerides. A more detailed description of the lipid content of 
naturally occurring lipoproteins can be found, for example, in Meth. Enzvm . 128 

25 ( 1 986). The composition of the lipids are chosen to aid in conformation of the 
apoprotein for receptor binding activity. The composition of lipids can also be 
chosen to facilitate hydrophobic interaction and association with the 
polynucleotide binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by 

30 ultracentrifugation, for instance. Such methods are described in Meth. Enzv ., 
supra : Pitas et al., 1980, J. Biochem . 255:5454-5460: and Mahey et al., 1979, J. 
Oin. Invest 64:743-750. 

30 
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Lipoproteins can also be produced by in vitro or recombinant methods by 
expression of the apoprotein genes in a desired host cell. See, for example, 
Atkinson et al., 1986, Annu Rev Biophvs Chem 15:403, and Radding et al., 
1958, Biochim . Biophvs Acta 30:443. 
5 Lipoproteins can also be purchased from commercial suppliers, such as 

Biomedical Techniologies, Inc., Stoughton, Massachusetts, USA. 

Mutants, fragments and fusion of the naturally occurring apoproteins are 
useful for delivery of polynucleotides. These polypeptides will retain more than 
about 80% amino acid identity; more typically, more than about 85%; even more 

10 typically, at least 90%. Preferably, these polypeptides will exhibit more than 
about 92% amino acid sequence identity with naturally occurring lipoproteins or 
fragment thereof; more preferably, more than about 94%; even more preferably, 
more than about 96%; even more preferably, more than about 98%; even more 
preferably, more than about 99% sequence identity. 

15 Such mutants, fragments and fusions can be constructed by altering the 

polynucleotides encoding the desired lipoproteins by recombinant DNA 
techniques. See, for example, Sambrook et al., (1989) Molecular Cloning, A 
Laboratory Manual, 2d edition (Cold Spring Harbor Press, Cold Spring Harbor, 
New York). These polynucleotides can be inserted into expression vectors and 

20 host cells can be utilized to produce the desired apoprotein. 

In addition, naturally occurring lipoproteins, mutants, fragments, and 
fusions can be chennucally altered. For example, acetylated LDL has biological 
activity. See, for example, Nagelkerke et al., 1983, J. Biol . Chem . 
258(20):12221-12227; Weisgraber et al„ 1978. J. Biol . Chem . 253:9053-9062: 

25 Voyta et al., 1984, 1 Cell Biol . 99:2034-2040; Goldstein et al., 1979. Proc . Natl . 
Acad. Sci. USA 76:333-337; and Pitas, 198 1 , Arterosclerosis 1: 177-1 85. 

Chemically modified lipoproteins can also be purchased from 
conmiercial suppliers, such as Biomedical Techniologies, Inc., Stoughton, 
Massachusetts, USA. 

30 All of these polypeptides exhibit receptor binding properties of naturally 

occurring lipoproteins. Usually, such polypeptides exhibit at least about 20% 
receptor binding of naturally occurring lipoproteins. More typically, the 
polypeptides exhibit at least about 40%, even more typically the polypeptides 

3/ 
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exhibit at least about 60%; even more typically, at least about 70%; even more 
typically, at least about 80%; even more typically, at least about 85%; even more 
typically, at least about 90%; even more typically, at least about 95% receptor 
binding of the naturally occurring lipoproteins. 

5 Typically, lipoproteins are present in an amount effective to increase the 

frequency of incorporation of polynucleotides into a cell. Such an amount is 
sufficient to increase the frequency of incorporation of polynucleotides into a cell 
by at least 10%, compared to the frequency of incoporation of naked 
polynucleotides; more usually, at least 15%; even more usually, 20%; even more 

0 usually, at least 30%. The increase can be between 40 to 100%, and even 1000% 
and 10000% increase. 

"Polynucleotide binding molecule" refers to those compounds that 
associate with polynucleotides, and the association is not sequence specific. For 
example, such molecules can ( 1 ) aid in neutralizing the electrical charge of 

5 polynucleotide, or (2) facilitate condensation of nucleotides, or (3) inhibit serum 
or nuclease degradation. Optionally, polynucleotide binding molecules can 
interact with lipoproteins by either hydrophobic association or by charge. 
Polynucleotide binding molecules include, without limitation, polypeptides, 
mineral compounds, vitamins, etc. 

0 Examples of polynucleotide binding molecules include: polylysine, 

polyarginine, polyomithine, and protamine. Examples of organic polycations 
include: spermine, spermidine, and purtrescine. Other examples include 
histones, protamines, human serum albumin, DNA binding proteins, non-histone 
chromosomal proteins, coat proteins from DNA viruses, such as (|>X174, 

5 transcriptional factors also contain domains that bind DNA and therefore may be 
useful as nucleic aid condensing agents. Briefly, transcriptional factors such as 
C/CEBP, cvwn, C./05, AP-l, AP-2, AP-3, CPF, Prot-1, Sp-1, Ocl-1, Oct-2, 
CREP, and TFIID contain basic domains that bind DNA sequences. 

Examples of other positively charged moieties include polybrene, 

0 DEAE-dextran, and cationic lipids. Useful cationic lipids and liposomes are 
described above. Lipids and liposomes are not used in this aspect of the 
invention to encapsulate both polynucleotide and lipoprotein. The lipoprotein 
must be exposed to bind the its cell surface receptor. 

3^ 
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Other synthetic compounds that are capable of binding negatively 
charged polynucleotides are useful, such as polymers of N-substituted glycines 
and others, as described below. 

In a composition with a lipoprotein, the polynucleotide binding molecule 
5 can be present in an amount effective to neutralize the polynucleotide. However, 
the polynucleotide binding molecule also can be in excess of an effective amount 
to neutralize the polynucleotide to be delivered. Such an excess can produce a 
net positive electrical charge when complexed with the polynucleotides to be 
delivered. The positively charged complex can then interact with lipoproteins 
ID that comprise negatively charged lipids, such as phospholipids. 

Typically, the polynucleotide binding molecule is in excess when the 
amount is 10% greater than the amount to neutralize the polynucleotide charge; 
more typically, the amount is 50% greater; even more typically, 100% greater; 
even more typically, 150% greater; even more typically, 200% greater; even 
15 more typically, 500% greater; even more typically, 20,000% greater; even more , 
typically, 22,000% greater; even more typically, 25,000% greater; even more 
typically, 30,000% greater; even more typically, more than 40,000% greater than 
the amount effective to neutralize the electrical charge of the desired 
polynucleotide. 

20 POLYCATIONIC AGENTS 

Polycationic agents can be included, with or without lipoprotein, in a 
composition with the desired polynucleotide to be delivered. 

Functional properties 
A. Net Positive Charge 
25 Polycationic agents typically exhibit a net positive charge at 

physiological relevant pH and are capable of neutralizing the electrical charge of 
nucleic acids to facilitate delivery to a desired location. These agents have both 
in vitro^ ex vivo^ and in vivo applications. For example, these polycationic agents 
can be used to transfect cells used to produce recombinant proteins. 
30 Alternatively, the instant polycationic agents can be used to deliver nucleic acids 
to a living subject either intramuscularly, subcutaneously, etc. 

33 
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Physiological relevant pH varies somewhat between in vitro and in vivo 
applications. Typically, physiological pH is at least 5.5; more typically, at least 
6.0; even more typically, at least 6.5. Usually, physiologically relevant pH is no 
more than 8.5; more usually, no more than 8.0; even more usually, no more than 
5 7.5. 

Preferably, the isoelectric point of the instant polycationic agents to 
neutralize nucleic acids is at least 9. 

B. Non-Toxicitv and Non-Immunogenic Properties 

The composition of the polycationic agents of the invention will exhibit 
10 the toxicity and inmiunogenic properties desired. In vitro cell culture will have 
different immunogenic constraints than in vivo mammalian applications. 

The instant polycationic agents can be easily tested for toxicity. For 
example, the agents can be added to medium for cells used in the in vitro assays, 
such as cos-7, Chinese Hamster Ovary cells, etc. Alternatively, the agents can be 
15 tested in standard animal tests for safety. 

C. Condensation Properties 

Due to the electric charge, a subset of these polycationic agents are 
capable of condensing the desired nucleic acids to a compact size to facilitate 
delivery. Typically, condensation **collapses" polynucleotides or nucleic acids 
20 into macromolecular structures, commonly into a toroid form. The smaller size 
of condensed nucleic acids eases delivery by facilitating, for example, packaging 
nucleic acids into liposomes and/or reducing exposure to proteases and/or 
nucleases. 

The condensed nucleic acids exhibit different properties compared to 
25 "relaxed** nucleic acids, such as ( 1) a decrease in intercalation of ethidium 
bromide or other intercalating dye or (2) a reduced mobility in gel 
electrophoresis. Thus, condensation can be measured by at least two different 
assays, an intercalating dye assay or a band shift assay. 

One type of intercalating dye assay uses ethidium bromide. In this assay, 
30 test nucleic acids, conveniently plasmid DN A, are mixed with polycationic agent 
in a ratio from about 1:1 to a 1 :50 weight/weight ration of plasmid to condensing 
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agent. Following incubation, ethidium bromide is added to the reaction to a final 
concentration of 1 |ig/niL. If a nucleic acid such as RNA is used as the test 
nucleic acid, acridine orange may be used as the intercalating dye. The reaction 
mixtures are transferred into UV transparent plastic tubes spotted with 1 % 
5 agarose gel, or placed upon UV transparent plastic c and illuminated with 260 
nm light. The emission from the DNA-ethidium bromide complex is recorded 
on film by a camera equipped with an appropriate UV filter. The ability of an 
agent to condense DNA is inversely proportional to the intensity of the 
fluorescence in each reaction mixture. 

• 10 The more precise test is a band shift assay. Briefly, this assay is 

performed by incubating nucleic acids, either labeled or unlabeled, with various 
concentrations of candidate condensing agents. Test nucleic acids, conveniently 
plasmid DNA, and condensing agent are mixed at 1: 1 to 1 :50 w/w ratios. 
Following incubation, each sample is loaded on a 1% agarose gel and 

15 electrophoresed. the gel is then either stained with ethidium bromide or dried 
and autoradiographed. DNA condensation is determined by the inability to enter 
the gel compared to a non-condensed standard. Sufficient condensation is 
achieved when at least 90% of the DNA fails to enter the gel to any significant 
degree. 

20 Condensation can also be measure by directly determining the size of the 

complex using a light scattering instrument such as the a Coulter N4MD 
subtnicron analyzer, for example. Polynucleotides and a condensing agent are 
incubated at an appropriate ratio, either alone or in the present of 2% PEG-2000 
(Fisher Scientific), and 0.6 M NaCl., and then diluted into 3 ml of water. This 

25 dilute solution is analyzed by the Coulter counter which will detect particles with 
a mean size of 0-1,000 nanometers (nm). Condensing agents, such as poly-L- 
lysine, typically yield particles with a mean diameter of approximately 50-200 
nm. See Lee et al., 1 996, J. mol. Chem . 27 1 : 848 1-8487. 



D. Serum and/or Nuclease Protection Properties 
30 The instant polycationic agents are capable of protecting nucleic acids 

from degradation in serum or from nucleases, including nucleases present in 
biological fluids, such as serum, prostate, synovial fluid, etc. One advantage of 
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this type of protection is that smaller amounts of the desired nucleic acids are 

needed for efficient administration. 

When present in effective amounts, these polycationic agents can inhibit 

serum degradation by at least 5 minutes as compared with uncomplexed nucleic 
5 acids; more usually, the amount used is sufflcient to inhibit degradation by at 

least 10 minutes; even more usually; the amount used is sufficient to inhibit 

degradation by at least 30 minutes; even more usually, the amount used is 

sufficient to inhibit degradation by at least 45 minutes; even more usually, the 

amount used is sufficient to inhibit degradation by at least 60 minutes; even more 
10 usually, the amount used is sufficient to inhibit degradation by at least 90 

minutes; and more usually, the amount used is sufficient to inhibit degradation 

by at least 120 minutes. 

Increased serum protection can be measured simply by incubation of the 

polycation/polynucleotide complex with mouse serum, for example. Preferably, 
15 the serum will not be heat inactivated. After incubation, the mixture can be 

analyzed by gel electrophoresis to determine the quantity of the polynucleotides 

remaining after incubation. 

Alternatively, nucleases can be added to the polycationic agent/nucleic 

acid complexes. The resulting mixture can be analyzed by gel electrophoresis to 
20 determine the amount of degradation. Other biological fluids, such as prostate 

flud, can also be tested. 



E. Mediating Entry of Polvnucleotides into a Cell 
The polycationic agents can mediate entry of polynucleotides into a cell. 
Incorporation of polynucleotides into a cell can be measured by either protein 
25 expression assays or polynucleotide hybridization techniques, for example. 

One method of detecting frequency of incorporation is to include a gene 
that encodes a marker protein, such as luciferase. Cells that have incorporated 
the delivered polynucleotides will express the marker protein. The protein can 
be detected by standard immunoassays, or by biological or enzymatic activity, as 
30 in the case of luciferase. 
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Alternatively, standard hybridization techniques, such as Southern or 
Northern blots or polymerase chain reaction (PGR) techniques, can be used to 
detect the presence of the desired polynucleotides. 

F. Additional Properties 

To facilitate entry of nucleic acids to the interior of cells, the instant 
agents can be capable of 

(a) binding the polynucleotide to the cell surface; 

(b) cell membrane destabilization; 

(c) triggering endocytosis; 

(d) endosome buffering capacity; 

(e) releasing DNA/lipid complexes from endosomes; or 

(f) nuclear tropism. 

Assays for detecting these characteristics are standard and known to those skilled 
in the art. 

15 Physical Properties 

The following physical characteristics are factors to consider when 
determining the composition of the polycationic agents: 

(a) distance between the substituents and the backbone 

(b) the total length of the chain; 

20 (b) hydrophobicity and/or aromacity; 

(c) number of hydrogen bonding groups; and 
(c) charge, including 

(i) type of charge group, (ii) density of charge and (iii) position. 
Other relevant characteristics include structural flexibility. For example, a 
25 helical conformation of the polycationic agent may be preferred for some 
applications. 

Specific dimensions to be considered include 

(a) the distance of phosphate groups in the polynucleotide of interest; and 

(b) the distance of monomer groups in the agents of interest. 

37 

BNSDCXJID: <WO 9806437A2J_> 



5 



]0 



WO98/06437 PCT/US97/14465 

Polypeptide Polvcationic Agents 

The following are examples of useful polypeptides as polycationic 
agents: polylysine, polyarginine, polyomithine, and protaniine. Other examples 
include histones, protamines, human serum albumin* DNA binding proteins, 
5 non-histone chromosomal proteins, coat proteins from DNA viruses, such as 

4)X174, transcriptional factors also contain domains that bind DNA and therefore 
may be useful as nucleic aid condensing agents. Briefly, transcriptional factors 
such as C/CEBP, c-jun. c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, Oct-1, Oct- 
2, CREP, and TFIID contain basic domains that bind DNA sequences. 
10 Organic polycationic agents include: spermine, spermidine, and 

purtrescine. 

The dimensions and of the physical properties of a polycationic agent can 
be extrapolated from the list above, to construct other polypeptide polycationic 
agents or to produce synthetic polycationic agents. 



15 Synthetic Polvcationic Agents 

Synthetic polycationic agents which are useful include, for example, 
DEAE-dextran, polybrene. Lipofectin®, and lipofectAMINE™ are monomers 
that form polycationic complexes when combined with polynucleotides. 

A preferred group of polycationic agents of the present invention have the 
20 following general formula (I): 



Ri R2 O 
I f II 

Ta I — N-C— C I — Tc 



R3 



n 



A preferred subset of these compounds include compounds having formula (I) 
where R, is hydrogen. Even more preferred are polymers comprising at least one 
25 natural amino acid. Also preferred are polymers where R, and R3 are both 
hydrogen, also referred to as poly N-substituted glycines or poly NSGs. 
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A. Monomers 

The polycationic agent of the invention comprises monomers with the 
following structure (11): 

Ri Ra O 
i T II 
— N— C— C 

m 

Generally, R,, R,, and R, are moieties each with a molecular weight from 

1 to 250 daltons. More typically, the molecular weight is no more than 200; 

even more typically, no more than 175. 
5 Typically, each monomer comprises one hydrogen at R,, R,, or More, 

typically, either R^ and R3are both hydrogen, the stmcture of an L-amino acid; or 

R-j and R3 are both hydrogen, the structure of a NSG. 

Monomers to be utilized in the polycationic agents can be either 

positively or negatively charged. Also, neutral substituents can also be utilized. 
10 Degradation sites can be incorporated into the polymer, for example, by 

including substituents from a natural amino acid when R, and R^ are hydrogen. 

These monomers can be positively or negatively charged, or neutral. 

As a general rule, a basically charged monomer has a pKa value for the 

side chain of at least 7.5. Positively, or basically, charged monomers include 
15 without limitation those containing the following functional groups: amino, 

guanidino, hydrazido, and amidino. These functional groups can be either 

aromatic or aliphatic. 

Positively charged monomers comprising hydrogen at R3 and R^, can be 

included in the polycationic agent, for example, as a degradation site. Such 
20 degradation site may aid in separation of the polycationic agent from the 

polynucleotide to permit further processing. For an L-amino acid like monomer, 

useful Rj substitutents are, for example, from those found in naturally occurring 

amino acids, such as lysine and arginine. Also, sidechains from amino acid 

analogues can be used such as ornithine and canaline; or modifications of basic 
25 amino acids, such as homoarginine, and modifications of other amino acids such 

as guanidinovalinate, and aminoethylcysteine. The substitutents found in L- 
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amino acids can also be incorporated at the R, and R3 positions of the instant 
polycationic agents. 

Naturally occurring amino acids and analogues are designated D-amino 
acids to indicate the chirality of these molecules. L-amino acids can also 
5 incorporated as monomers into the polycationic agents. The subsiituents of L- 
amino acids can be, for example, the same as those named for the D-amino acids. 

Preferable monomers include N-substituted glycine monomers. 
Exemplary N--substitutions include alklphenyl, indolylalkyl, alkoxyphenyl, 
halophenylaikyl, hydroxyphenylalkyl, as well as the N-substitutions shown 
below. 



^0 
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I 



N— C— (CH->)n' 
II 

* R2 



N— R4 
I 

Rs 



N— C— (CH2)n 



— NHi* 



Alky I ammoni um, 
where preferably R i = H; 

R2 = CH3; 

R3, R4, R5 can each be CH3. or CH3CH2; and 

n = 1-6. 

N — C— (CH2)n— NH-..^ NHo 
I ' ^ " 

NH2 

GuanidinoalkyU 

where preferably R 1 = H; 

R2 = H. CH3; 

R3 = CH3, CH3CH2; and 

n= 1-6. 



Aminoaikyl, 

where preferably R 1 = H; 

R2 = H, CH3; 

R3 = CH3, CH3CH2; and 

n= 1-6. 



N— C— (CH2)n 
^ R2 




NH2^ 
NH2 



Amidinoalkyl, 

where preferably R 1 = H; 

R2 = H, CH3; 

R3 = CH3, CH3CH2; and 

n= 1-6. 




Aminocyc lohexy I 




Guanidinobenzy] 

where preferably R ] s= H; and 

R2 = H. CH3 
R3, R4, and R5 each can be H or 



NH 




N— H 



Piperidyl 





Amidinobenzyl 

where preferably R ] = H; 

R2 = H, CH3; and 
NH2 R3^ R4^ R5 each can be - 



NH2 



NH2 
'NH2* 



Pyridylmethyl 
where preferably 
R|=H; 

R2 = H, CH3; and 

R4 = H, CH3O, CI. F, Br, CH, N02, CH3. 



R2 



Aminobenzyl 
where preferably 
Rl =H; 

R2 = H, CH3; and 

R4 = H. CH3O, CI F, Br, CH. NO2, CH3. 



The positively charged substituents described above can also be placed ai 
the RjOr positions of formulas (I) and (II), 
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The polycationic agents can comprise negatively charged or neutral 
monomers. As with the positively charged monomers, D-amino acid, L-amino 
acid, and NSGs are preferred to be incorporated as monomers. 

The following are examples of such monomers: 




tryptamine 



( 



amphetamine 




e(hytt>enzenesuUonamid6 



ft 



NHt 



4Z 
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^ MthoxypheneUiyls 



chlorophenechyls 




^3 
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Polvcationic Polymers 



to 



15 



20 



25 



Typically, the polycationic agents exhibit a predicted isoelectric point of at least 
9, excluding the terminal groups. Further, the agents contain, excluding the terminal 
groups, at least 20% positively charged monomers; more typically, at least 25%; more 
typically, 30%; and preferably, at least 33% positively charged monomers. Typically, 
the agents do not comprises greater than 5% acidic monomers and preferably none. 

The charge density and composition of the polycationic agent can be altered to 
accommodate the specific nucleic acid sequence, type, and other components included 
with the complex of nucleic acids and polycationic agent. 

Usually, the length of the polymer is at least 8 monomers; even more usually, 12 
monomers; even more usually, 18 monomers. More typically, the polycationic agents of 
the invention will be at least 24 monomer units in length; more typically, 30 monomer 
units; even more typically, 36 monomer units; even more typically, 48 monomer units. 
The polycationic agent can be up to 50 to 75 to 1 00 monomer imits in length. 

Preferably, the polycationic agent comprises monomers where all and R3 are 
hydrogen. Even more preferably, where all R2 and R3 are hydrogen, the polycationic 
agent comprise repeating trimer units with the following monomer sequence (from amino 
to carboxy terminus): (1) neutral monomer, (2) neutral monomer, and (3) positively 
charged monomer. 

Preferably, the neutral monomer comprises an aromatic group at the R, position; 
more preferably, wherein the aromatic group comprises a single ring; even more 
preferably, wherein the aromatic group is a six member ring. 

Typically, the positively charged monomer is aminoalkyl at the Ky position; more 
typically, the aminoalkyl comprises 1-6 carbon molecules; even more typically, the 
aminoalkyl is aminoethyl. 

Typically, the polycationic agent comprises between 3 to 20 repeating trimers, 
trimers having two neutral and one positively charged R| groups are preferred, such as, 
for example, trimer shaving the sequence, neutral monomer, neutral monomer, positively 
charged monomer. More preferably, the polycationic agent comprises 5 to 1 8 trimers; 
preferably 8 to 16 trimers; and even more preferably, 12 to 16 trimers. 




SUBSTITUTE SHEET (RULE 26) 
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10 



15 



Optionally, the polycationic agent contains only positively charged 
monomers, excluding the terminal groups. Typically, such a polycationic agent 
comprises between 24 and 48 monomers; more typically, 30 to 40 monomers; 
even more typically, 36 monomers. 

Polycationic agents of the present invention containing only positively 
charged monomers typically have guanidinoalkyl sidechains. Typically, the 
guanidinoalkyl sidechain comprises 1 to 6 carbon molecules. Preferably, the 
side chain is guanidino ethyl. 

C. Neutral Polymers 
A preferred group of neutral polymers of the present invention have the 
general formula (I): 



Preferably, is hydrogen. Even more preferred are polymers comprising at 
least one natural annino acid. Also preferred are polymers having formula (I) 
where Rj and Rj are hydrogen, also referred to as poly N-substituted glycines or 
poly NSGs. 

Monomers employed in neutral polymers of the present invention have 
the same general formula as monomers employed in cationic polymers of the 
present invention, i.e.: 



Generally, R,, R,, and R, are moieties each with a molecular weight from 
1 to 250 daltons. More typically, the molecular weight is no more than 200; 
even more typically, no more than 1 75. 

Typically, each monomer comprises one hydrogen at R,, R,, or R3. More, 
typically, either R, and R, are both hydrogen, the structure of a L-amino acid; or 
and R3 are both hydrogen, the structure of a NSG. 





Rl R2 G 
N— C— C 
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Monomers to be utilized in the neutral agents can be either positively or 
negatively charged. Also, neutral substituents can also be utilized. Neutral 
polymers exhibit no net positive or negative charge, excluding the terminal 
groups. 

5 Degradation sites can be incorporated into the polymers by using 

naturally occuring amino acid substituents in monomers when R, and R3 are 
hydrogen. 

Naturally occurring amino acids and analogues are designated D-amino 
acids to indicate the chirality of these molecules. L-amino acids can also 
10 incorporated as monomers into the neutral polymers. The substituents of L- 

amino acids can be, for example, the same as those named for the D-amino acids. 

Preferred monomers include N-substituted glycine monomers, and 
monomers that are capable of forming hydrogen bonds and/or ionic bonds with 
the polynucleotides to be delivered. 
15 Examples of monomers for the neutral polymers include those described 

above and in the Examples below. 



D. Linking Polvmers Together 

Polymers can be linked together incorporating terminating groups or 
sidechains that permit cross-linking of the polymers. For example, polymers can 
20 be linked by a disulfide bond. Other terminating groups useful for coupling 
polymers include, carbonate, urea, and the like. 



E. Additional Groups to be Incorporated into the Polymer 
Additional components can be included in the polycationic agents of the 
instant invention, such as targeting ligands. Such additional groups can facilitate 
25 endocytosis of the desired nucleic acids or aid binding of the nucleic acids to the 
cell surface. 

Polypeptides can be incorporated into the polycationic agents. Examples 
include, without limitation: asialoorosomucoid (ASOR); transferrin; 
asialoglycoproteins; antibodies; antibody fragments; ferritin; interleukins; 
30 interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 

granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating 
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factor (M-CSF), stem cell factor and erythropoietin. Viral antigens, such as 
envelope proteins, can also be used. Also, proteins from other invasive 
organisms are useful, such as the 17 amino acid peptide from the 
circumsporozoite protein of Plasmodium falciparum known as RII. 

5 In addition, lipoproteins can be incorporated into the polycationic agent, 

such as low density lipoprotein, high density lipoprotein, or very low density 
lipoprotein. Mutants, fragments, or fusions of these proteins can also be used. 

Other groups that can be incorporated include without limitation: 
hormones, steroids, androgens, estrogens, thyroid hormone, or vitamins, folic 

0 acid. Folic acid can be incorporated into the polycationic agent according, for 
example, to Mislick et al., 1995, T.J> Bioconiugate Chem. 6i5 12. 

Also, the polycationic agents of the instant invention can be chemically 
conjugated with polyalkylene glycol. In a preferred embodiment, the 
polyalkylene glycol is polyethlylene glycol. PEG can be incorporated with a 

5 polycation agent according, for example, to Lu et aJ., 1994, Int. J. Pept. Protein 
Res. 43:127. 

In addition, the polycationic agent can be chemically conjugated with 
mono-, di-, or polysaccaride. In a preferred embodiment of this aspect, the 
polysaccharide is dextran. 
} These additional groups can be incorporated within the polycationic 

agent. For example, R,, R,, and R3 can be a substituent that is capable of being 
activated to cross link with any one of the above groups. For example, a thiol 

group could be included to cross link with another group to form a disulfide 
bond. I 

5 F. Terminal Groups 

The terminal groups of the instant polycationic agents can be chosen as 
convenient. Suitable terminal groups (i.e., Ta and Tc) include, for example, 
-NH,, -OH, -SH, and -COOH. Terminal groups can be selected to enhance the 
targeting properties of the polycationic agent and can be any of the additional 
) groups described above. 

The additional groups described above can be incorporated at the 
terminus of the polycationic agent. For example, the polycationic agent can be 

^7 
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(1) acylated with a variety of carboxylic acids; (2) sulfonyiated with sulfonyl 

chlorides; or (3) derivatized with isocyanatcs or isothiocyanates. Once activated, 

the terminus can be reacted with any of the above-mentioned groups, such as a 

polypeptide, such as low density lipoprotein, or folic acid. 
5 One means of adding a terminal group to the polycationic agent is, for 

example, is (1) to acylate the amino terminus with Fmoc-amino-hexanoic acid; 

and (2) to remove the protecting group, Fmoc, to generate a primary amine, 

which can be further functionalized. 

Altematively, the amino-terminal groups can include, without limitation: 
0 acyl, such as acetyl, benzoyl; or sulfonyl, such as dansyl. 

Carboxy terminal groups can include, for example, amide or alkyl amide. 

Synthesis of Polvcationic Agents 

Polycationic agents of the present invention can be synthesized by either 
solid or solution phase methods. The following is a solid phase method for the 
5 synthesis of NSGs, which can be generally used for a wide variety of side-chain 
substitutents. This method can be performed utilizing automated peptide 
synthesis instrumentation to permit rapid synthesis of polycationic agents of 
interest. Such instruments are conunercially available from, for example. 
Applied Biosystems and Milligen. 

0 A. Two Step Monomer Assembly 

A method of synthesis is to assemble the monomer from two 
submonomers in the course of extending a polymer comprising an NSG 
monomer. This technique is described in Zuckermann et al-» 1992, J Amer Chem 
Soc 114(26): 10646- 10647, and Zuckermann et aL, PCT Patent Publication No. 

5 WO 94/06451. The NSGs can also be considered to be an alternating 
condensation of copolymer of an acy lating agent and an amine. 

The direction of polymer synthesis with the submonomers occurs in the 
carboxy to amino direction. The solid-phase assembly for each monomer, in the 
course of polymer formation, eliminates the need for Na-protected monomers, as 

0 only reactive side-chain functionalities need to be protected. 
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Each monomer addition comprises two steps, an acylation step and a 
nucleophiiic displacement step as shown in Figure 1. 

Specifically, each cycle of monomer addition consists of two steps: 

(1) acylation of a secondary amine bound to the support with an 

5 acylating agent comprising a leaving group capable of nucleophilic displacement 
by an amine and a carbonyl group, preferably carboxyL An example is a 
haloacetic acid; and 

(2) nucleophilic displacement of the leaving group with a sufficient 
amount of a submonomer comprising a primary amino group to introduce a side- 

10 chain. The amino group containing submonomer can be an alkoxyamine, 
semicarbazide, acyl hydrazide, substituted hydrazine or the like. 

Acylation can be activitated with carbodiimide or other suitable 
carboxyiate activation method. 

The efficiency of the displacement is modulated by the choice of halide, 

15 e.g., I>CL Protection of aliphatic hydroxyl groups, carboxyiic acids, carboxy, 
thiol, amino^ some heterocycles, and other reactive side-chain functionalities is 
preferred to minimize undesired side reactions. However, the mild reactivity of 
some side-chain moieties toward displacement or acylation may allow their use 
without protection., e.g., indole, imidazole, and phenol. 

20 B. Three Step Monomer Assemblv 

NSGs can also be constructed utilizing a three step method for 
assembling each monomer as the polymer is extended. The backbone of the 
monomer of first extended by acylation step followed by a nucleophilic 
displacement. The side chain is introduced by a second acylation step. The 
reaction scheme is shown in Figure 2. 

The backbone of the monomer is assembled in the first two steps of the 
synthesis cycle. The first reaction is an acylation step where the carbonyl group 
of the acylating agent reacts with an amine. The acylating agent comprises a 
carbonyl group; a backbone, R,; and a leaving group. L. Preferably, the carbonyl 

25 group is carboxy 1. 

The second step is a nucleophilic displacement of the leaving group by 
the first amino group of the displacing agent. The displacing agent comprises a 
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first and a second amino group and a backbone, R^. The first amino group is a 
primary amine, and the second step produces a secondary amine. 

The third step is another acylation in which the another acylating 
submonomer reacts with the first amino group of the displacing agent to produce 
5 a tertiary amide. The acylation agent comprises a carbonyl group; an optional 
linker; and a sidechain. Preferably, the carbonyl group is carboxyl. 

Pharmaceutical Compositions 

The polycationic agent/pplynucleotide complexes, whether or not 
encapsulated in liposomes, may be administered in pharmaceutical compositions. 
10 The pharmaceutical compositions comprise a therapeutically effective amount of 
nucleic acids. 

The term ''therapeutically effective amount'' as used herein refers to an 
amount of a therapeutic agent sufficient to detectably treat, ameliorate, or prevent 
a particular disease or condition, i.e., an amount sufficient to induce a detectable 

1 5 therapeutic or preventative effect. The effect may include, for example, chemical 
markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount 
for a subject will depend upon the subject*s size and health, the nature and extent 
of the cardiovascular condition, and the therapeutics or combination of 

20 therapeutics selected for administration. Thus, it is not useful to specify an exact 
effective amount in advance. However, the effective amount for a given 
situation can be determined by routine experimentation and is within the 
judgment of the clinician. For purposes of the present invention, an effective 
dose will be from about 0.01 mg/ kg to 50 mg/kg or 0,05 mg/kg to about 10 

25 mg/kg of the DNA constructs in the individual to which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically 
acceptable carrier. The term ''pharmaceutically acceptable carrier" refers to a 
carrier for administration of a therapeutic agent, such as antibodies or a 
polypeptide, genes, and other therapeutic agents. The term refers to any 

30 pharmaceutical carrier that does not itself induce the production of antibodies 
harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly 

so 
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metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polygiycoiic acids, polymeric amino acids, amino acid copolymers, and inactive 
vims particles. Such carriers are well known to those of ordinary skill in the art. 
Pharmaceutically acceptable salts can be used therein, for example, 
5 mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, 
and the like; and the salts of organic acids such as acetates, propionates, 
malonates, benzoates, and the like. A thorough discussion of pharmaceutically 
acceptable excipients is available in Remington 's Pharmaceutical Sciences 
(Mack Pub. Co., N.J. 1991). 

10 Pharmaceutically acceptable carriers in therapeutic compositions may 

contain liquids such as water, saline, glycerol and ethanol. Additionally, 
auxiliary substances, such as wetting or emulsifying agents* pH buffering 
substances, and the like, may be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as liquid solutions or 

15 suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles/ 
prior to injection may also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered 

20 ( 1 ) directly to the subject; (2) delivered ex vivo, to ceils derived from the subject; 
or (3) in vitro for expression of recombinant proteins. The subjects to be treated 
can be manmials or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by 
injection, either subcutaneously, intraperitoneally, intravenously or 

25 intramuscularly or delivered to the interstitial space of a tissue. The 

compositions can also be administered into a tumor or lesion. Other modes of 
administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

30 Methods for the ex vivo delivery and reimplantation of transformed cells 

into a subject are known in the art and described in e.g.. International Publication 
No. WO 93/14778 (published 5 August 1993). Examples of cells useful in ex 
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vivo applications include, for example, stem cells, particularly hematopoetic, 
lymph cells, macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro 
applications can be accomplished by the following procedures, for example, 
dextran-mediated transfection« calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei, 
all well known in the art. 

The examples presented below are provided as a further guide to the 
practitioner of ordinary skill in the art, and are not to be construed as limiting the 
invention in any way. 

Example 1 
Synthesis of Polvcationic Agents 
This example describes the synthesis of polycationic agents with the 
following structure: 



where and are hydrogen for all monomers. All polymers describe in this 
example terminate in an amino and a carboxyl group unless specified, such as a 
folate terminating group. 

The polycationic agents described below were synthesized according to 
the procedures described in Figliozzi et al., 1996, Meth. Enzv . 267 :437*447, and 
Zuckennann et al., 1992, J. Amer . Chem. Soc > 1 14(26^:10646-10647. 

All polymers were synthesized using bromoacetic acid and primary 
amines. The following are substitutents of the primary amines to be positioned 
at Rj to construct the polycationic agents: 




5Z 
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Cationic Sidechalns 



Other Sidechairis 



r 



P « aminoethyt 




H s (S)-a-methylbenzy1 



Pf ■ aminopropyt 



H-*- c (R)^-methyfben2yl 



P* - (S>-1-metliylethylenediamine 



4^ 



Bnn benzyl 



Q « trimethyiaminoethyl 



G •= guanldinoetliyl 

jr 



Qp « ouanldinopropyt 



Y 

NH 





Ph B phenethyt 



Nm «= naphthytmethyl (R « H) 

sN -(»K»-fnethvlnaphthylmcthyJ (R -CHi) 



^'Vn. '^-Nwrrolldinopropyl 




Chm B cyciohexylmethyl 




Co 



Ff • lurfuryimethyl 




Me « methos^ethyf 



*OCHj 
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Other Sidechains 




OHbH 




. p-MeOPh - P^nethoxyphenylethyl 




OCH) 




|>OPh - p-chlorophenylethyl 



Tyr « p4iydn>xyphenylethyi 

OH 



3f 



BNSDOCID: <WO 9aoe437A2 I > 



« 



wo 98/06437 



PCT/US97/14465 



Peptofd-Fofic actd conjugates 



n 



1. Fmoo-aminohexanolc add 
2.. 20% plpeiicUne/DMF 




HPeptold 




1. 0.1 M Foac add ill OMSQ. DlC HOBU 50* 

2. Trtfhjoroaoeticacfd 



NH OH 




-PAptold 
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Abbreviation 


Description 


Bn 


benzyl 


Chm 


cyclohexylmethyl 


Ff 


furfurylmethy] 


G 


fiuanidinoethyl 


Gp 


guanidinopropyl 


H 


(S) alpha-methylbenzyl 


H+ 


(R) alpha-methylbenzyl 


Me 


me thoxy ethyl 


Nm 


naphthyimethyl 


P 


aminoethyl 


P' 


aminohexyl 


p* 


(S)-a-methylaminoeihyl 


Ph 


phenethyl 


Pr 


aminopropyl 


Py 


N-pyrrolidinopropyl 


Tmb 


3,4,5,-trimethoxybenzyl 


0 


trimethylaminoethyl 


Phpr 


phenylpropyl 


6-gal 


6-galactosyl 


Trp 


N-2-(3-indolylethyl) 


pMeOph 


p-methoxyphenethyl 


pClPh 


p-chlorophenethyl 


Tyr 


p-hydroxyphenethyl 


sN 


(S )-a-methy Inaphthy Imethyl 



The polycationic agents synthesized include: 



Name 


Sequence 


Length 


Mol. 
Wt. 


# charges 


RZllO-1 


(HHP)6 


18 


2550.8 


7 


RZl 10-2 


(HP)9 


18 


2367.8 


10 


RZllO-3 


(HPP)6 


18 


2184.8 


13 


RZl 10-4 


(HPPPHHP 


18 


2123.8 


14 


RZliO-5 


(HHF)6 


18 


2869.8 


7 


RZl 10-6 


(HF)9 


18 


2871.8 


10 


RZl 10-7 


(HFP')6 


18 


2856.8 


13 


RZl 10-8 


(HPPPHHP' 


18 


2851.8 


14 


RZl 10-9 


(HHP) 12 


36 


5084.6 


13 


RZl 10-10 


(HP) 18 


36 


4718.6 


19 


RZllO-11 


(HPP)12 


36 


4352.6 


25 


RZllO-12 


PP(HPPP)8HP 


36 


4230.6 


27 


RZl 10-13 


(HHP') 12 


36 


5722.6 


13 


RZl 10-14 


(HP") 18 


36 


5726.6 


19 


RZllO-15 


(HP'P)12 


36 


5696.6 


25 



5(p 
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Mnl 

Wt 
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fPhDrPhDrPU2 
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5419 




RZ 140-2 


(6-Kal)12-(PhPhP)12 


48 


7712 


13 


RZ 140-3 


(TrpTrpP)12 


36 


6020 


13 


RZi44-l 


(PhPPh)12 


36 


5083 


13 


RZ 144-2 


(PPhPh)12 


36 


5083 


13 


RZ 144-3 


(pMeoPhpMeoPhP) 


36 


5803 


13 
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Name 


Sequence 


Length 


Mol. 
Wt. 


# charges 




^^pv-,lr np\-.Jr rir } 


jO 


lU 


I J 


RZ144-5 


AMCA-(PhPhP)12 


36 


5411 


12 


RZ 144-8 


(TyrTyrP)12 


36 


5467 


12 


RZ144-12 


(6gal6ga]P)12 


36 


6475 


13 


*RZ145-1 


(PhPhP)12 


36 


5085 


13 


RZ147-2 


(PpMeOPhpMeOPh) 1 2 


36 


5805 


13 



*purified 



To summarize the method, Fmoc-Rink amide resin (NovaBiochem, San 
Diego, California, USA) is used as the solid support. This is the same resin that 
is used for the Fmoc synthesis of peptide C-terminal amides. The poiycationic 
5 synthesis begins with the deprotection of the Fmoc group on the resin with 20% 
(v/v) piperidine-dimethylformamide (DMF). The amino resin is then acyiated 
with bromoacetic acid. This is followed by nucieophilic displacement of the 
bromide with a primary amine to build the NSG monomer. The latter two steps 
are then continued in an iterative fashion to elaborate the desired oligomer. 

10 All reactions and washings were performed at room temperature unless 

otherwise noted. Washing of the resin refers to the addition of a wash solvent 
(usually DMF or dimethylsulfoxide (DMSO)) to the resin, agitating the resin so 
that a uniform slurry is obtained (typically for about 20 seconds), followed by 
thorough draining of the solvent from the resin. Solvents were removed by 

15 vacuum filtration through the fritted bottom of the reaction vessel until the resin 
appeared dry (typically about 5 seconds). In all the syntheses, resin slurries were 
agitated via bubbling argon up through the bottom of the fritted vessel. 

A fritted reaction vessel was charged with 100 mg (50 fxmol) of Fmoc- 
Rink amide resin with a substitution level - 0.50 mitH/g resin. Two milliliters of 

20 DMF was added to the resin and this solution was agitated for 1-2 minutes to 

swell the resin. The DMF was then drained. The Fmoc group was then removed 
by adding 2.0 ml of 20% piperidine in DMF to the resin. This was agitated for 1 
minute and then drained. Another 2 mi of 20% piperidine in DMF was added to 
the resin and agitated for 15 minutes and then drained. The resin was then 

25 washed with DMF, six times with 2 ml. 

The deblocked amine was then acyiated by adding 850^1 of 0.6 M 
bromoacetic acid in DMF to the resin followed by 200 ^l of 3.2 M N,N'- 
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10 



15 



20 



25 



diisoproopiycarbodiimide (DlC) in DMF. This solution was agitated for 30 
minutes at room temperature and then drained. This step was repeated a second 
time. The resin was then washed with DMF, twice with 2 ml and DMSO, once 
with 2 ml. This completed one reaction cycle. 

The second cycle was initiated by the acylating step with bromoacetic 
acid and DIC, followed by displacement with the second amine. This 
acylation/displacement cycle was repeated until the desired oligomer was 
obtained. 

Cleavage of the resin from the polycationic agent is as follows. The dried 
resin was placed in a glass scintillation vial containing a teflon-coated micro stir 
bar, and approximately 5 ml of 95% trifluoroacetic acid (TEA) in water was 
added. The solution was stirred for 20 minutes and then filtered through an 8-ml 
solid-phase extraction (SPE) column fitted with a 20-|im polyethylene frit into a 
50 ml polypropylene conical centrifuge tube. 

The resin was washed with 1 ml 95% TFA. The combined filtrates were ' 
then lyophilized three times from 1 : 1 acetonitrilerwater. Material was 
redissolved to a concentration of 5 mM in 5% acetonitrile in water. 

Preparation of puanidinoalkvl- containing Polymers : 

The guanidinoalkyl sidechains were introduced into the polymers by 
post-synthesis modification of aminoalkyl sidechains. Thus, polymers were 
synthesized by the submonomer method as described above except that 
methoxybenzhydrylamine (MBHA) resin was used instead of the Rink resin. 
Wherever a guanidinoalky sidechain was desired, a mono-Boc-alkanediamine 
was incorporated in the displacement step. After elaboration of the polymers, the 
sidechain Boc groups were removed by treatment with 95% TF A/water for 20 
min at room temp. (This does not remove the oligomer from the soliud support). 
The free amino groups were then guanidinylated by treatment with 1 H-pyrazole- 
l-carboxamidine (1 M in DMF, 2 x 1 hr, 40*^0). After washing with DMF and 
methylene chloride, the oligomer was cleaved from the resin with hydrofluoric 
acid, and lyophilized. 
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Preparation of folic acid - polvmer conjugates : 

Folic acid - polymer conjugates were prepared by adding a linker to the 
N-terminus of the resin-bound polymer which was then acylated with folic acid. 
Specifically, after elaboration of the polymer, the N-terminus was acylated with 
Fmoc-aminohexanoic acid (0.5 M in DMF, 0.5 M hydroxybenzotriazole, 0.5 M 
5 diisopropylcarbodiimide (DIG), 1 x 1 hr, room temp.). After Fmoc group 

removal (20% pipcridine/DMF, 1 x 20 min, room temp.), the free primary amino 
group was acylated with folic acid (0. 1 M in DMSO, 0. 1 M DIG, 1 x 2 hr, 50°C). 
After washing of the resin, the conjugate was cleaved with 95% TF A/water in 
the usual fashion. 



10 Example 2 

Gondensation of Polvnucleotides 
Polycationic agents were synthesized and isolated to a final concentration 
of 5 mM as described in Example 1 . Polynucleotides were condensed with 
RZl 10, RZl 12, and RZ120 series compounds according to the following 
15 procedure. 

(1) Dilute all polycationic agents to a final concentration of 3 
nanomoles of positive charge per microliter. 

(2) Add I |xg of DNA to 1-2^1 of diluted polycationic agents. 

(3) Adjust volume to 10 |il. This mixture can be stored overnight at 

20 4'^G. 

(4) Add of 5 ill of DNA/polycationic mixture to 2 ^1 of 5X buffer, 
which does not contain SDS to maintain the complex. (5X buffer = 40% sucrose, 
0.25% bromphenol blue and 200 mM Tris Acetate, 4 mM EDTA (PH 7.8). 

(5) Adjust volume to 10 ^1. 

25 (6) Run sample on a 1% agarose gel utilizing 75 volts for 1.5 hours.. 

Between 1 to 2 ^1, all polycationic agents were judged to retard the 
migration of DNA into an agarose gel. 



60 
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Example 3 
Inhibition of Scrum Degradation 
The RZl 10, RZl 12, and RZ120 series compounds were mixed with 
polynucleotide as described in Example 2. Five microliters of the overnight 
mixture was added to 5 jai of BalbC mouse serum. The serum was not heat 
5 treated but freeze thawed. The serum, poiycationic agent, and polynucleotide 
mixture was incubated typically for 30 minutes at ST'^C. The time of incubation 
varied between S and 60 minutes 

Next, 2 ^l of 5X buffer containing 0.5% (wt/v) SDS was added to the 
incubated mixture. This fmal solution was loaded onto a 1% agarose gel and 
10 electrophoresed at 75 volts for 1.5 hours. 

All of the compounds tested, i.e., the entire RZl 10, 1 12, and 120 series, 
provided significant protection in a direct comparison. The entire RZl 12 series 
and RZl 10-3 and RZl 10-8 inhibited serum degradation better than poly-L- 
lysine. 

15 Example 4 

Pcptoid Mediated In Vitro Delivery 
DN A comprising a luciferase gene 1 |ig/)^, was diluted into endotoxin 
free water. The plasmid DNA was CMVKm luciferase, which is described in 
more detail in Example 5. 
20 The transfection protocol for in vitro delivery was as follows: 

(A) HT1080 cells were used. These cells are available from American 
Type Culture Collection, Rockville. Maryland, USA, Accession No. CCL 121. 
This is a fibrosarcoma. The growth medium was Dulbecco's Modified Eagle 
medium (DME) with 10% heat-inactivated fetal calf serum. 

25 (B) Twenty four hours prior to transfection, the cells were placed at 5 X 

10** per well of a 24- well plate in 1 ml of medium. 

1 . Feed cells with 500^1 of DME- 10% fetal calf serum (PCS )or 500^1 

Opti-MEM®. Opti-Mem® can be purchased from Gibco BRL, Life 

Technologies, Inc., Gaithersburg, Maryland, USA. 
30 2. Add 200 ^il Opti-MEM® to each tube. 

(or 
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3. 

MEM®. 



Add 3 fil of the desired polycationic agent to the 200^1 of Opti- 



4. Add 2 |Lil of l^g/)il luciferase DNA, mix. 

5. Incubate mixture for 5 minutes at room temperature. 

5 6. Add 100 ^l of the polycationic agent/DNA mixture to plate with 

DME-FCS, 100 ^il to cells fed with Opti-MEM®. 

7. Incubate cells and polycationic agent/DNA mixture for -4 hours at 

37°C. 

S. Change media on all cells to DME*FCS. 
10 9. DME-FCS was used as a positive control. 

As a control, a transfectant, LTl, was used from Panvera, Inc., Madison, 
Wisconsin, USA to transfect cells in serum and cells in Opti-MEM®. 

10. Cells were tested for luciferase activity using a Promega Luciferase 
Assay System from Promega, Madison, Wisconsin, USA., in accordance with 
15 the manufacturer s directions. 



Results: 



Name 


Formula 


Luciferase 
(RLU) 


RZ120-1 


(MeMeP)8 


0 


RZ120-2 


{BnBnP)8 


0.93 


RZI20-3 


(HHP)8 


1.38 


RZ120-4 


(H4-H+P)8 


1.5 


RZ 120-5 


(MeMeP)12 


0 


RZ120-6 


(BnBnP)12 


1.64 


RZ 120-7 


(HHP) 12 


2.64 


RZ120-8 


(H+H-fP)12 


2.84 


RZ120-9 


(MeMeP)16 


0 


RZ120-10 


(BnBnP)16 


1.42 


RZ120-n 


(HHP) 16 


1.94 


RZ120-12 


(H-hH+P)16 


1.32 


control 


LTl 


51.96 



/ ^ 

to/- 



BNSDCXJID: <WO 9806437 A2J_> 



wo 98/06437 PCT/US97/14465 



Experiment #2 




Name 


Formula 


LtUciferasc 
(RLU) 


RZllO-1 


(HHP)6 


0.0015 


RZl 10-2 


(HP)9 


0.0012 


RZl 10-4 


(HPPP)4HP 


0.0004 


RZl 10-5 


(HHF)6 


0.0006 


RZllO-6 


(HF)9 


0 0052 


RZl 10-7 


(HP*P')6 


0 0005 


RZl 10-8 


(HFPPHHF 


0 0003 


RZl 10-9 


(HHP) 12 


8.7 


RZl 10-10 


(HP) 18 


0.0014 


RZllO-12 


PP(HPPP)8HP 


0.0459 


RZllO-13 


(HHP) 12 


2.5 


RZllO-14 


(HF)18 


2.2 


RZl 10-15 


(HP'F)12 


0.064 


RZl 10-16 


PP'(HP'FP)8 


0.01 


control 


LT 1 


88.7 



63 
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Example 5 
Targeting Ligand 
A, Cells, Vector, and Compositions Used. 

In a first experiment, murine endothelial cells (Py-4-1) which express 
5 high levels of acetylated-LDL receptors. The cells and the LDL receptors are 
described in Dubois et al., 199 1, Exp. Cell Res . 196:302-3 13. 

A luciferase-containing plasmid (pCMVkmLUC) was used to determine 
if polynucleotides could be delivered and expressed into endothelial cells when 
associated with polycationic agents described in Example 1 with acetylated-LDL 
10 (Ac-LDL). A description of the identification and isolation of endothelial cells 
based on their increased uptake of acetylated-low density lipoprotein is in Voyta 
et al., 1 984, L Cell Biol . 99: 2034-2040. 

The plasmid used in these experiments pCMVkmLUC, was constructed 
by inserting the luc + gene from pSP-luc+ (Promega Corporation, Madison, WI) 
15 into the expression vector pCMVkm2. Briefly, pSP-luc+ was digested with the 
restriction enzymes Nhel-EcoRV (Boehringer Mannheim, Indianapolis, IN) and 
a fragment of 1691 bp was isolated by standard methods. This fragment was 
inserted into pCMVkin2, which had been digested with Xbal and EcoRV using 
the Rapid Ligation Kit (Boehringer Mannheim, Indianapolis, IN). The sequence 
20 of pCMVkm2 is depicted in SEQ ID NO:2 and described below. The luc -H gene 
was cloned into pCM Vkm2 such that expression is driven by the CMV 
immediate early enhancer promoter and terminated by the bovine growth 
hormone polyadenylation signal. 

The luciferase expression was compared to levels obtained with the same 
25 vector delivered in conjunction with iipofectannine, an agent used commonly to 
transfect cells in vitro (Hawley-Nelson et al., 1993, Focus 15:7 3). The results are 
presented in the table below. 



B. Method of Transfection : 

Briefly, the cells were plated in 24 well dishes, groMfn to approximately 
30 80% confluence, transfected and assayed 24 hours later for luciferase activity. 
All transfections were done in serum containing medium. During transfection 
mixture preparation, pCMVkmLUC was first mixed with RZ 1 12, and the DNA 

6f 
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caiionic polycationic agent complexes were then added to Ac-LDL. Serum 
containing medium was then added to the mixtures to adjust the voiume 
delivered to each well to 0.5 ml. 

Lipofectamine was used as a positive control. No lipoprotein was added 
5 to this positive control. Lipofectamine is a 3: 1 (w/w) liposome formulation of 
the polycationic lipid 2,3.-dioleylosy-N-[2(spermine-carboxamido)ethyl]-N,N- 
dimethyl-l-propanaminiumtrifluoroacetate (DOSPA) and the neutral lipid 
dioleoyl phosphatidyl-ethanoiamine (DOPE) in membrane-filtered water. 
Lipofectamine can be purchased from Life Technologies, Gaithersburg» 
10 Maryland, USA). 



C. Luciferase Assay 

Luciferase activity was assayed using the Promega Luciferase Assay 
System, Madison, Wisconsin. 



15 D. Results 

Table 1 shows the results of an experiment where the polycationic agent, 
RZ-1 12-2 was compared to lipofectamine to deliver the luciferase gene to cells 
comprising the acetylated LDL receptor. 

Table 1 
LuciferaseActivitv 





Ac-LDL 


pCMVkmLUC 


RZ 1 12-2 


pg luc/mg 


Group 


(MB) 


(MR) 


(nm) 


Drotein 


1 


5 


10 


2.5 


61 


2 




10 




0 


3 




10 


2.5 


16 


4 


0.5 


1 


0.25 


16 


5 




1 


0.25 


17 


6 


0.5 


10 


2.5 


631 


7 


0.5 


10 


5 


1996 



LIPOFECTAMINE CONTROL 

8 10 : 10786 

* each number represents the mean of three wells. 
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Example 6 

Comparison of Cells with and without Acetvlated LDL Receptors 

A. Cells with Acetvlated LDL Receptors 

For this experiment, K1735 mouse, epithelial melanoma cells were used. 
5 These cells express low or non-existent levels of Ac-LDL receptors. A 
description of the cells is in J. Natl . Cancer Inst . 69(4): (1982). 

B. Methods 

Briefly, the cells were plated in 24 well dishes at 10,000 cells per well in 
10 DME with 10% PCS supplemented with 2 mM L-glutamine. The Py-4-1 cells 
were cultured in 10% CO^ at 37°C. The K1735 cells were cultured in 5% CO, at 
37°C. The cells were grown to approximately 50% confluence, transfected and 
assayed 24 hours later for luciferase activity. All transfections were done in 
serum containing medium. 
15 During transfection mixture preparation, pCMVkmLUC was first mixed 

with RZ 1 12-2, and the DNA-polycationic agent complexes were then added to 
Ac-LDL. Serum containing medium was then added to the mixtures to adjust the 
volume delivered to each well to 0.5 ml. 

C. Results 



Table 2 

Luciferase Activitv 



Group 


Ac-LDL 


pCMVkmLUC 


RZ 112-2 


pg luc/mg protein* 




(Mg) 


(Mg) 


(nm) 


Py-4-1 


K1735 


1 


0.5 


1 


1 


1301 


24 


2 


0.5 


1 


5 


2181 


0 


3 


0.5 


1 


10 


373 


0 


4 


0.5 


10 


5 


840 


0 


5 




1 


5 


327 


0 


6 




1 


5 


945 


ND 


10 


5 


1 


5 


298 


ND 


UPOFECr AMINE CONTROL 








7 




1 




23 


0 


8 




10 




2878 


960 



* each number represents the mean of three wells. 
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Example 7 

Iniection of Polynucleotides Encoding Erythropoietin 

A. Polynucleotides 

CMVlan2 is the standard vector used in these studies, CMVIan2 is a 
yeclor optimized for expression in mammalian cells. The gene of interest is 
5 cloned into a polylinker which is inserted 3' of a human CMV expression 
cassette. This cassette contains the human CMV inunediate early 
promoter/enhancer followed by intron A of the human CMV immediate early 
region (Chapman el al., 1991 , Nucl. Acids Res . 19:3937-3986). Transcription is 
terminated by a polyadenylation site from the boyine growth hormone gene, 

10 which has been cloned inunediately 3' of the polylinker. See SEQ ID NO:2 for 
the CMVkm2 yector. 

The CMV-km-cmEPO yector was constructed from CMVkm2 as 
follows. The cynomolgus monkey EPO cDNA was acquired from the ATCC 
(Accession No. 67545, Rockyille, MD). This plasmid was cut with Avrll and 

15 Bglll and inserted into the Xba l and BamH I sites of the CMVkm2 yector. The 
inserted sequence contains the entire coding region of cmEPO (Genbank 
accession M 1 8 1 89). See SEQ ID NO:3. 

B. Mice 

Inununodeflcient seyere combined immunodeficiency (SCID) mice were 
20 obtained from Charles Riyer Labs, Wilmington, Massachusetts, USA. 

Intramuscular injections were performed as follows: mice were 
anaesthetized with SO |li1 of a solution which contained 80 mg/ml ketamine and 4 
mg/ml of xylazine. The area surrounding the anterior tibialis muscle was shayed. 
Fifty ^l of DN A, at a concentration of 2.7 ug/^l in .9% saline solution was 
25 injected into the anterior tibialis muscle of both legs using a 28 gauge needle. 
Twenty-four hours after the first injection, a second injection was performed 
using the identical protocol. Blood was taken from the orbital sinus to determine 
hematocrits on a weekly basis. 
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C. Result 

The hematocrit readings on 6 mice which were injected with plasmid, 
CMVkm-cmEpo (which expresses the cynomolgus monkeys EPO cDNA), are 
shown in Table 3 below. The row marked control shows the average reading for 
5 three uninjected mice. The raw data for the three control mice is shown in the 
lower part of Table 3. Mouse 2 in the injected group died between 4 and 5 
weeks post-injection. 
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Example 8 

Injection of Polynucleotides Encoding Leptin 

A. Polvnucieotides 

5 The CMV-km2 vector, described above, was used for these experiments. Either the 

wild-type or HA version of the leptin coding region v^^as inserted into the vector. The map of 
the plasmid is depicted in Figure 4 and the sequence of the vector with the wild type leptin is 
shown in SEQ ID NO:4. 

B. Mice 

*0 Ob/ob mice were obtained from Jackson Labs, Bar Harbor, Maine, USA. The first of 

the recessive obesity mutations, the obese mutation {ob) was identified and described in 1950 
by Ingall et al., 1950, J. Hered. 41:317-318. Subsequently, 5 single-gene mutations in mice 
have been observed to produce an obese phenotype, as described in Friedman et a]., 1 990, 
Cell 69:217-220. (More recently, the mouse obese gene and its human homologue have been 

15 cloned, as described in Zhang ct al * 1994, Nature 372:425). 

C. Method 

Intramuscular injections were performed as follows: mice were anaesthetized with the 
same ketamine solution described above in the Example 7 and the area surrounding the 
anterior tibialis muscle was shaved. 
20 Fifty microliters of DNA at a concentration of 3.3. ^tg/^l in 0.9% saline solution was 

injected into the anterior tibialis muscle of both legs using a 28 gauge needle. 

Seventy-two hours after the first injection, a second injection was performed using the 
identical protocol. 

Group 1 ob/ob mice were injected with a plasmid (CMVkM leptin-wt) which encodes 
25 the wild-type mouse leptin protein. 

Group 2 ob/ob mice were injected with a plasmid (CM Vkm-leptinHA) which encodes 
a form of mouse leptin which is modified with the epitope which is recognized by the 
antibody 12CA5. The amino acid sequence of the epitope is SYPYDVPDYASLGGPS 
(Wilson et al„ 1 984, Cell 37: 767-778). 
30 Group 3 ob/ob mice were injected with a solution of 0.9% saline. 
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The mice were weighed each day (see Table 4) and the proponional weight gain for 
each mouse during the first eight days was calculated. The results are shown in Table 5. For 
any given day, the weight was subtracted from the weight of the individual mouse on day 0, 
5 and the difference was divided by the weight on day 0. The proportional weight change data 
from day 8 was analyzed using an unpaired t-lest. When compared with group 3 control mice 
the p value from group 2 mice was 0.004. When compared with group 3 control mice, the p 
value for group 1 mice is 0.0038. 

Note: the mice were not weighed on day 1 and day 2, the values for these days were 
10 extrapolated from day 3. 
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Table 5 



Proportional Change in Weight from Day 0 of Mice Injected with cDNA for Leptin 
(gpl), Lepiin-HA (fip2) or Saline 




Group 1 Mice Below Injected with CMVkM-Leptin-wt 




day 0 


day i 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


day 8 


mouse 1 


0 


0.009 


0.018 


0.028 


0.028 


0.059 


0.059 


0.059 


0.066 


mouse 2 


0 


0.013 


0.026 


0.039 


0.039 


0.085 


0.085 


0.085 


0.085 


mouse 3 


0 


0.006 


0.012 


0.018 


0.018 


0.062 


0,062 


0.062 


0.062 


mouse 4 


0 


0.014 


0.028 


0.042 


0.042 


0.065 


0.065 


0.065 


0.065 


mouse 5 


0 


0.007 


0.014 


0.021 


0.021 


0.042 


0.042 


0.042 


0.06 




0 


0.007 


0.014 


0.021 


0.021 


0.041 


0.041 


0.041 


0.061 






















Group 2 Mice Below Injected with CMVkM-LeptinHA 




day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


day 8 


mouse 1 


0 


0.006 


0.013 


0.02 


0.02 


0.06 


0,06 


0.06 


0.06 


mouse 2 


0 


0.013 


0.031 


0.046 


0.023 


0.046 


0.046 


0.046 


0.046 


mouse 3 


0 


0.007 


0.014 


0.021 


0.021 


0.04 


0.04 


0.04 


0.062 


mouse 4 


0 


0.006 


0.012 


0.02 


0.02 


0.06 


0.04 


0.04 


0.06 


mouse 5 


0 


0.014 


0.028 


0.043 


0.065 


0.065 


0.065 


0.08 


0.08 






















Group 3 Mice Below Injected with Saline 




day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


day 8 


mouse ] 


0 


0.016 


0.032 


0.05 


0.05 


0.06 


0.06 


0.125 


0.125 


mouse 2 


0 


0,07 


0.014 


0.021 


0.04 


0.04 


0.06 


0.08 


0.08 


mouse 3 


0 


0.013 


0.026 


0.04 


0.04 


0.08 


0.08 


0.1 


0.1 


mouse 4 


0 


0.02 


0.04 


0.06 


0.06 


0.08 


0.08 


0.1 


0,12 


mouse 5 


0 


0.015 


0.03 


0.045 


0,069 


0.09 


0.12 


0.14 


0.14 


Average Proportional change in Weight for Each Group 




day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


dav 8 


group 1 


0 


0.009 


0.018 


0.028 


0.028 


0.059 


0.059 


0.059 


0.066 


group 2 


0 


0.009 


0.018 


0.028 


0.029 


0.054 


0.05 


0.053 


0.061 


group 3 


0 


0.014 


0.028 


0.042 


0.051 


0.07 


0.08 


0.109 


0.113 
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Example 9 

Peptoid Mediated In Vitro Delivery in COS. HT1080. and 293 Cell Line^; 
COS cells (available from the American Type Culture Collection, Rockville, MD, 
under Accession No. CRL 1651 and HT1080 cells (available from the American Type 
Culture Collection, Rockville, MD, under Accession No. CCL 121) were cultured and 
5 transfected with pCMVkmLUC and various polycationic agents of the present invention 
(described in Example 1) according to the transfection protocol described in Example 4. 
Luciferase activity was assayed according to the method described in Example 4. Total cell 
protein was measured using a Pierce BCA kit according to manufacturer's directions. 

The results, shown in Figure 7A, indicate that the ability of the polycationic agents to 
10 mediate transfection is not dependent on cell line type. Polycationic agents having a 
repeating trimer motif of neuu-al and cationic sidechains were particularly effective at 
mediating transfection. 

Transfection efficiencies for a homologous series of cationic peptoids were evaluated. 
Specifically, cationic peptoids RZ-1 10-1 (18-mer). RZ-120-3 (24mer), RZ120-7 (36mer), and 
15 RZ 120-1 1 (48mer), which have the same repeating (HHP) motif were evaluated for their 
ability to transfect COS and HT1080 cells. These polycationic agents were complexed with 
pCM VkmLUC at a 2: 1 , + to - charge ratio. The concentration of negative charges on DNA 
was calculated using 3.03 nmol of phosphate per 1 ^g of DNA, on the basis of the average 
molecular weight of 330 for each nucleotide. The formula weight of the polycationic agent 
20 was calculated as a semi-trifluoroacetate salt (50% of amino groups form salt with TFA), and 
the concentration of the polycationic agent was determined on the basis of the weight of the 
lyophilized peptoid. Amino groups were formally considered to be fiiUy protonated to obtain 
the number of positive charges on the polycationic agent interest when calculating the + to - 
charge. 

25 As shown in Figure 7B, transfection efficiencies for this particular series of cationic 

peptoids were largely independent of oligomer length for peptoids having 24 or more 
monomeric units. 

Transfection efficiencies using polycationic agent RZ 145-1 and commercially 
available cationic lipids, DMRIE-C'^, Lipofectin* and lipofectamine were evaluated. In these 
30 experiments RZ145- 1 was complexed with pCMVkmLUC at a 2: 1 , + to - charge ratio. 
Transfection with DMREE-C'" Lipofectin®, lipofectamine was conducted according to 
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manufacturer's directions. The cationic lipids were also employed at a 2: 1, + to - charge ratio. 
293 human embryonic kidney cells (Microbix, Toronto, Ontario, Canada), HT1080 cells, and 
NIH-3T3 cells (available from the American Type Culture Collection, Rockville, MD, 
Accession No. CRL 1658) were transfected, cultured either in the presence or absence of 10% 
5 serum, then assayed for luciferase production using the same protocol as described in 
Example 4. Luciferase was measured, as described in Example 4, 48 hours after initial 
transfection. Total cell protein was measured using a Pierce BCA kit according to 
manufacturer's directions. 



10 lipofectamine, which were respectively 10- and 100-fold less efficient in the presence of 

serum, gene transfer mediated by polycationic agent RZ145-1 was insensitive to the presence 
of serum. 

Transfection mediated by polycationic polymers, such as polylysine and histones, is 
greatly enhanced by addition of chloroquine to the transfection media. To determine whether 

15 chloroquine affected transfection mediated by polycationic agents of the present invention, 
HT1080 and 293 cells were transfected using RZ14S-1 in the presence and absence of 
chloroquine. As a control, the same cell lines were transfected with polylysine both in the 
presence and absence of chloroquine. The results, shown in Figure 9, indicate that the 
polycationic agent RZ145-1 was equally effective at mediating transfection both with and 

20 without chloroquine. In contrast, polylysine-mediated transfection in the absence of 

chloroquine was 100-fold lower than polylysine mediated transfection in the presence of 
chloroquine. In addition, the results indicate that cationic peptoid mediated transfection is 
more efficient than polylysine mediated transfection. 



A. DN A/Pol vcatinic Agent Complex Formation (2: L to - Change Ratio) 

All operations were carried out at ambient temperature. DGPW (diagnosis grade 
purified water) was used to prepare the stock solutions. Both the plycatinic agent and DNA 
samples had low salt concentrations (i.e., < ImM) to avoid precipitation. 



The results, shown in Figure 8, indicate that, in contrast to Lipofectin* and 



25 



Example 10 

Preparation of a Stable Formulation of DNA/Polvcationic Agent Complex 
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(I) Batch Method 

Complexes of polycationic agent RZ145-1 and pCMVkmLUC, as follows, for up to 
250 fig DNA. DN A (i.e., pCMVkmLUC) was diluted with 30% (v/v) elhanol in water to a 
concentration of 50 |ag/mi corresponding to 151 jiM of negative charge. RZ 145-1 was 
5 diluted to 23.2 |xM in 30% ethanoi in water. To 1 part of the polycationic agent solution was 
added 1 part of DNA solution as quickly as possible with gentle agitation. The DNA solution 
was added to the solution of polycationic agent (rather than vice-versa) to avoid precipitation. 
Slow addition of the two solutions was avoided to avoid precipitation and the formation of 
large complexes. 

10 (2) Continuous Method 

For more than 250 jig of DNA, a continuous method for preparing concentrated 
formulations of polycationic agent/DNA complex is preferred. The DNA and peptoid 
solutions were prepared as above and placed into separate bottles. Each bottle was connected 
to one port of a mixing tee. The bottles were simultaneously pressurized with 2 to 3 psi to 

15 deliver the two streams to the mixing tee at the same flow rate (e.g., 20 ml/min or higher). 

B. Concentration Step 

Two milliliters of the DN A-polycationic agent complex from part A was placed in a 
Centricon*-100 (Amico Inc. Beverly, MA), and centrifuged at 1000 x g for 30 nninutes or 
until the volume of the retentate containing polycationic agentDNA complex was 

20 approximately 50 The filtrate was removed from the bottom receiver. The retentate was 
diluted with 2 ml of 5% glucose, and concentrated to 50 |il again. This operation was 
repeated again to produce a concentrated complex solution containing I mg/ml DNA in 5% 
glucose. This concentration step can be conducted at either 4*'C or at ambient temperature. 
The ethanoi content of the final concentrated solution was less than 0.1 %. No precipitation 

25 was observed in the concentrated solution. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT; Zuckermann et al . 

(ii) TITLE OF INVENTION: Compositions and Methods for 

Polynucleotide Delivery 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE; Chiron Corporation 
<B) STREET: 4560 Horton Street 

(C) CITY: Emeryville 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 94608-2916 

(V) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Fujita, Sharon M. 

(B) REGISTRATION NUMBER: 3 8,4 59 

(C) REFERENCE /DOCKET NUMBER: 1218.002 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (510) 923-2706 

(B) TELEFAX: (510) 655-3542 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GCGGCCGCGG AATTCTCATG TTTGACAGCT TATCATCGAT AAGCTGATCC TCACAGGCCG 60 

CACCCAGCTT TTCTTCCGTT GCCCCAGTAG CATCTCTGTC TGGTGACCTT GAAGAGGAAG 120 

AGGAGGGGTC CCGAGAATCC CCATCCCTAC CGTCCAGCAA AAAGGGGGAC GAGGAATTTG 180 

AGGCCTGGCT TGAGGCTCAG GACGCAMTC TTGAGGATGT TCAGCGGGAG TTTTCCGGGC 240 

TGCGAGTA^T TGGTGATGAG GACGAGGATG GTTCGGAGGA TGGGGAATTT TCAGACCTGG 300 

ATCTGTCTGA CAGCGACCAT GAAGGGGATG AGGGTGGGGG GGCTGTTGGA GGGGGCAGGA 360 

GTCTGCACTC CCTGTATTCA CTGAGCGTCG TCTAATAAAG ATGTCTATTG ATCTCTTTTA 420 

GTGTGAATCA TGTCTGACGA GGGGCCAGGT ACAGGACCTG GAAATGGCCT AGGAGAGAAG 480 

GGAGACACAT CTGGACCAGA AGGCTCCGGC GGCAGTGGAC CTCAAAGAAG AGGGGGTGAT 540 

AACCATGGAC GAGGACGGGG AAGAGGACGA GGACGAGGAG GCGGAAGACC AGGAGCCCCG 600 

GGCGGCTCAG GATCAGGGCC AAGACATAGA GATGGTGTCC GGAGACCCCA AAAACGTCCA 660 

AGTTGCATTG GCTGCAAAGG GACCCACGGT GGAACAGGAG CAGGAGCAGG AGCGGGAGGG 720 

77 
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GCAGGAGCAG GAGGGGCAGG AGCAGGAGGA GGGGCAGGAG CAGGAGGAGG GGCAGGAGGG 780 

GCAGGAGGGG CAGGAGGGGC AGGAGCAGGA GGAGGGGCAG GAGCAGGAGG AGGGGCAGGA 840 

GGGGCAGGAG GGGCAGGAGC AGGAGGAGGG GCAGGAGCAG GAGGAGGGGC AGGAGGGGCA 900 

GGAGCAGGAG GAGGGGCAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGAGG GGCAGGAGCA 960 

GGAGGAGGGG CAGGAGGGGC AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG AGGGGCAGGA 1020 

GCAGGAGGAG GGGCAGGAGC AGGAGGGGCA GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG 1080 

GCAGGAGCAG GAGGAGGGGC AGGAGGGGCA GGAGGGGCAG GAGCAGGAGG GGCAGGAGCA 1140 

GGAGGGGCAG GAGCAGGAGG GGCAGGAGCA GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG 1200 

GCAGGAGGGG' CAGGAGGAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGAGG GGCAGGAGGG 1260 

GCAGGAGCAG GAGGAGGGGC AGGAGGGGCA GGAGCAGGAG GGGCAGGAGC GGCAGGAGCA 13 20 

GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG GCAGGAGGGG CAGGAGGAGG AGGAGGGGCA 13 80 

GGAGCAGGAG GGGCAGGAGC AGGAGGTGGA GGCCGGGGTC GAGGAGGCAG TGGAGGCCGG 1440 

GGTCGAGGAG GTAGTGGAGG CCGGGGTCGA GGAGGTAGTG GAGGCCGCCG GGGTAGAGGA 1500 

CGTGAAAGAG CCAGGGGGGG AAGTCGTGAA AGAGCCAGGG GGAGAGGTCG TGGACGTGGA 1560 

GAAAAGAGGC CCAGGAGTCC CAGTAGTCAG TCATCATCAT CCGGGTCTCC ACCGCGCAGG 1620 

CCCCCTCCAG GTAGAAGGCC ATTTTTCCAC CCTGTAGGGG AAGCCGATTA TTTTGAATAC 1680 

CACCAAGAAG GTGGCCCAGA TGGTGAGCCT GACGTGCCCC CGGGAGCGAT AGAGCAGGGC 17 40 

CCCGCAGATG ACCCAGGAGA AGGCCCAAGC ACTGGACCCC GGGGTCAGGG TGATGGAGGC 1800 

AGGCGCAAAA AAGGAGGGTG GTTTGGAAAG CATCGTGGTC AAGGAGGTTC CAACCCGAAA 1860 

TTTGAGAACA TTGCAGAAGG TTTAAGAGCT CTCCTGGCTA GGAGTCACGT AGAAAGGACT 1920 

ACCGACGAAG GAACTTGGGT CGCCGGTGTG TTCGTATATG GAGGTAGTAA GACCTCCCTT 1980 

TACAACCTAA GGCGAGGAAC TGCCCTTGCT ATTCCACAAT GTCGTCTTAC ACCATTGAGT 2040 

CGTCTCCCCT TTGGAATGGC CCCTGGACCC GGCCCACAAC CTGGCCCGCT AAGGGAGTCC 2100 

ATTGTCTGTT ATTTCATGGT CTTTTTACAA ACTCATATAT TTGCTGAGGT TTTGAAGGAT 2160 

GCGATTAAGG ACCTTGTTAT GACAAAGCCC GCTCCTACCT GCAATATCAG GGTGACTGTG 2220 

TGCAGCTTTG ACGATGGAGT AGATTTGCCT CCCTGGTTTC CACCTATGGT GGAAGGGGCT 2280 

GCCGCGGAGG GTGATGACGG AGATGACGGA GATGAAGGAG GTGATGGAGA TGAGGGTGAG 2340 

GAAGGGCAGG AGTGATGTAA CTTGTTAGGA GACGCCCTCA ATCGTATTAA AAGCCGTGTA 2400 

TTCCCCCGCA CTAAAGAATA AATCCCCAGT AGACATCATG CGTGCTGTTG GTGTATTTCT 2460 

GGCCATCTGT CTTGTCACCA TTTTCGTCCT CCCAACATGG GGCAATTGGG CATACCCATG 2 520 

TTGTCACGTC ACTCAGCTCC GCGCTCAACA CCTTCTCGCG TTGGAAAACA TTAGCGACAT 2580 

TTACCTGGTG AGCAATCAGA CATGCGACGG CTTTAGCCTG GCCTCCTTAA ATTCACCTAA 2640 

GAATGGGAGC AACCAGCATG CAGGAAAAGG ACAAGCAGCG AAAATTCACG CCCCCTTGGG 2700 

AGGTGGCGGC ATATGCAAAG GATAGCACTC CCACTCTACT ACTGGGTATC ATATGCTGAC 2760 

TGTATATGCA TGAGGATAGC ATATGCTACC CGGATACAGA TTAGGATAGC ATATACTACC 2820 

CAGATATAGA TTAGGATAGC ATATGCTACC CAGATATAGA TTAGGATAGC CTATGCTACC 288 0 

CAGATATAAA TTAGGATAGC ATATACTACC CAGATATAGA TTAGGATAGC ATATGCTACC 294 0 

CAGATATAGA TTAGGATAGC CTATGCTACC CAGATATAGA TTAGGATAGC ATATGCTACC 3000 

CAGATATAGA TTAGGATAGC ATATGCTATC CAGATATTTG GGTAGTATAT GCTACCCAGA 3060 

TATAAATTAG GATAGCATAT ACTACCCTAA TCTCTATTAG GATAGCATAT GCTACCCGGA 3120 

TACAGATTAG GATAGCATAT ACTACCCAGA TATAGATTAG GATAGCATAT GCTACCCAGA 3180 

TATAGATTAG GATAGCCTAT GCTACCCAGA TATAAATTAG GATAGCATAT ACTACCCAGA 3240 

TATAGATTAG GATAGCATAT GCTACCCAGA TATAGATTAG GATAGCCTAT GCTACCCAGA 3300 

TATAGATTAG GATAGCATAT GCTATCCAGA TATTTGGGTA GTATATGCTA CCCATGGCAA 3360 

CATTAGCCCA CCGTGCTCTC AGCGACCTCG TGAATATGAG GACCAACAAC CCTGTGCTTG 3420 

GCGCTCAGGC GCAAGTGTGT GTAATTTGTC CTCCAGATCG CAGCAATCGC GCCCCTATCT 3480 

TGGCCCGCCC ACCTACTTAT GCAGGTATTC CCCGGGGTGC CATTAGTGGT TTTGTGGGCA 3540 

AGTGGTTTGA CCGCAGTGGT TAGCGGGGTT ACAATCAGCC AAGTTATTAC ACCCTTATTT 3600 

TACAGTCCAA AACCGCAGGG CGGCGTGTGG GGGCTGACGC GTGCCCCCAC TCCACAATTT 3660 

CAAAAAAAAG AGTGGCCACT TGTCTTTGTT TATGGGCCCC ATTGGCGTGG AGCCCCGTTT 3720 

AATTTTCGGG GGTGTTAGAG ACAACCAGTG GAGTCCGCTG CTGTCGGCGT CCACTCTCTT 3780 

TCCCCTTGTT ACAAATAGAG TGTAACAACA TGGTTCACCT GTCTTGGTCC CTGCCTGGGA 3840 

CACATCTTAA TAACCCCAGT ATCATATTGC ACTAGGATTA TGTGTTGCCC ATAGCCATAA 3900 

ATTCGTGTGA GATGGACATC CAGTCTTTAC GGCTTGTCCC CACCCCATGG ATTTCTATTG 3 960 

TTAAAGATAT TCAGAATGTT TCATTCCTAC ACTAGTATTT ATTGCCCAAG GGGTTTGTGA 4020 

GGGTTATATT GGTGTCATAG CACAATGCCA CCACTGAACC CCCCGTCCAA ATTTTATTCT 4080 

GGGGGCGTCA CCTGAAACCT TGTTTTCGAG CACCTCACAT ACACCTTACT GTTCACAACT 4140 

CAGCAGTTAT TCTATTAGCT AAACGAAGGA GAATGAAGAA GCAGGCGAAG ATTCAGGAGA 4200 

GTTCACTGCC CGCTCCTTGA TCTTCAGCCA CTGCCCTTGT GACTAAAATG GTTCACTACC 4260 

CTCGTGGAAT CCTGACCCCA TGTAAATAAA ACCGTGACAG CTCATGGGGT GGGAGATATC 4320 

GCTGTTCCTT AGGACCCTTT TACTAACCCT AATTCGATAG CATATGCTTC CCGTTGGGTA 4380 

ACATATGCTA TTGAATTAGG GTTAGTCTGG ATAGTATATA CTACTACCCG GGAAGCATAT 4440 

GCTACCCGTT TAGGGTTAAC AAGGGGGCCT TATAAACACT ATTGCTAATG CCCTCTTGAG 4500 

75 
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GGTCCGCTTA TCGGTAGCTA CACAGGCCCC TCTGATTGAC GTTGGTGTAG CCTCCCGTAG 
TCTTCCTGGG CCCCTGGGAG GTACATGTCC CCCAGCATTG GTGTAAGAGC TTCAGCCAAG 
AGTTACACAT AAAGGCAATG TTGTGTTGCA GTCCACAGCA TGCAAAGTCT GCTCCAGGAT 
GAAAGCCACT CAGTGTTGGC AAATGTGCAC ATCCATTTAT AAGGATGTCA ACTACAGTCA 
GAGAACCCCT TTGTGTTTGG TCCCCCCCCG TGTCACATGT GGAACAGGGC CCAGTTGGCA 
AGTTGTACCA ACCAACTGAA GGGATTACAT GCACTGCCCC GAATACAAAA CAAAAGCGCT 
CCTCGTACCA GCGAAGAAGG GGCAGAGATG CCGTAGTCAG GTTTAGTTCG TCCGGCGGCG 
GGGCGATCCG CGGCCGCCTG CGCGCTCGCT CGCTCACTGA GGCCGCCCGG GCAAAGCCCG 
GGCGTCGGGC GACCTTTGGT CGCCCGGCCT CAGTGAGCGA GCGAGCGCGC AGAGAGGGAG 
TGGCCAACTC CATCACTAGG GGTTCCTTGT AGTTAATGAT TAACCCGCCA TGCTACTTAT 
CTACGGCCGC GGAATTTCGA CTCTAGGCCA TTGCATACGT TGTATCTATA TCATAATATG 
TACATTTATA TTGGCTCATG TCCAATATGA CCGCCATGTT GACATTGATT ATTGACTAGT 
TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 
ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG CCCATTGACG 
TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG ACGTCAATGG 
GTGGAGTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA TATGCCAAGT 
CCGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG 
ACCTTACGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
GTGATGCGGT TTTGGCAGTA CACCAATGGG CGTGGATAGC GGTTTGACTC ACGGGGATTT 
CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA TCAACGGGAC 
TTTCCAAAAT GTCGTAATAA CCCCGCCCCG TTGACGCAAA TGGGCGGTAG GCGTGTACGG 
TGGGAGGTCT ATATAAGCAG AGCTCGTTTA GTGAACCGTC AGATCGCCTG GAGACGCCAT 
CCACGCTGTT TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTGACG TAAGTACCGC CTATAGACTC 
TATAGGCACA CCCCTTTGGC TCTTATGCAT GCTATACTGT TTTTGGCTTG GGGCCTATAC 
ACCCCCGCTC CTTATGCTAT AGGTGATGGT ATAGCTTAGC CTATAGGTGT GGGTTATTGA 
CCATTATTGA CCACTCCCCT ATTGGTGACG ATACTTTCCA TTACTAATCC ATAACATGGC 
TCTTTGCCAC AACTATCTCT ATTGGCTATA TGCCAATACT CTGTCCTTCA GAGACTGACA 
CGGACTCTGT ATTTTTACAG GATGGGGTCC ATTTATTATT TACAAATTCA CATATACAAC 
AACGCCGTCC CCCGTGCCCG CAGTTTTTAT TAAACATAGC GTGGGATCTC CGACATCTCG 
GGTACGTGTT CCGGACATGG GCTCTTCTCC GGTAGCGGCG GAGCTTCCAC ATCCGAGCCC 
TGGTCCCATC CGTCCAGCGG CTCATGGTCG CTCGGCAGCT CCTTGCTCCT AACAGTGGAG 
GCCAGACTTA GGCACAGCAC AATGCCCACC ACCACCAGTG TGCCGCACAA GGCCGTGGCG 
GTAGGGTATG TGTCTGAAAA TGAGCTCGGA GATTGGGCTC GCACCTGGAC GCAGATGGAA 
GACTTAAGGC AGCGGCAGAA GAAGATGCAG GCAGCTGAGT TGTTGTATTC TGATAAGAGT 
CAGAGGTAAC TCCCGTTGCG GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC 
TCGTTGCTGC CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 
CCATGGGTCT TTTCTGCAGT CACCGTCGTC GACCTAAGAA TTCAGACTCG AGCAAGTCTA 
GAAAGCCATG GATATCGGAT CCACTACGCG TTAGAGCTCG CTGATCAGCC TCGACTGTGC 
CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
ACAATAGCAG GGGGGTGGGC GAAGAACTCC AGCATGAGAT CCCCGCGCTG GAGGATCATC 
CAGCCGGCGT CCCGGAAAAC GATTCCGAAG CCCAACCTTT CATAGAAGGC GGCGGTGGAA 
TCGAAATCTC GTGATGGCAG GTTGGGCGTC GCTTGGTCGG TCATTTCGGT AGATAAGTAG 
CATGGCGGGT TAATCATTAA CTACAAGGAA CCCCTAGTGA TGGAGTTGGC CACTCCCTCT 
CTGCGCGCTC GCTCGCTCAC TGAGGCCGGG CGACCAAAGG TCGCCCGACG CCCGGGCTTT 
GCCCGGGCGG CCTCAGTGAG CGAGC6AGCG CGCAGCGAAC CCCAGAGTCC CGCTCA6AAG 
AACTCGTCAA GAAGGCGATA GAAGGCGATG CGCTGCGAAT CGGGAGCGGC GATACCGTAA 
AGCACGAGGA AGCGGTCAGC CCATTCGCCG CCAAGCTCTT CAGCAATATC ACGGGTAGCC 
AACGCTATGT CCTGATAGCG GTCCGCCACA CCCAGCCGGC CACAGTCGAT GAATCCAGAA 
AAGCGGCCAT TTTCCACCAT GATATTCGGC AAGCAGGCAT CGCCATGGGT CACGACGAGA 
TCCTCGCCGT CGGGCATGCG CGCCTTGAGC CTGGCGAACA GTTCGGCTGG CGCGAGCCCC 
TGATGCTCTT CGTCCAGATC ATCCTGATCG ACAAGACCGG CTTCCATCCG AGTACGTGCT 
CGCTCGATGC GATGTTTCGC TTGGTGGTCG AATGGGCAGG TAGCCGGATC AAGCGTATGC 
AGCCGCCGCA TTGCATCAGC CATGATGGAT ACTTTCTCGG CAGGAGCAAG GTGAGATGAC 
AGGAGATCCT GCCCCGGCAC TTCGCCCAAT AGCAGCCAGT CCCTTCCCGC TTCAGTGACA 
ACGTCGAGCA CAGCTGCGCA AGGAACGCCC GTCGTGGCCA GCCACGATAG CCGCGCTGCC 
TCGTCCTGCA GTTCATTCAG GGCACCGGAC AGGTCGGTCT TGACAAAAAG AACCGGGCGC 
CCCTGCGCTG ACAGCCGGAA CACGGCGGCA TCAGAGCAGC CGATTGTCTG TTGTGCCCAG 
TCATAGCCGA ATAGCCTCTC CACCCAAGCG GCCGGAGAAC CTGCGTGCAA TCCATCTTGT 
TCAATCATGC GAAACGATCC TCATCCTGTC TCTTGATCAG ATCTTGATCC CCTGCGCCAT 
CAGATCCTTG GCGGCAAGAA AGCCATCCAG TTTACTTTGC AGGGCTTCCC AACCTTACCA 
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GAGGGCGCCC 
TATCGCCATG 
GTCCAGATAG 
TTCTACGTGT 
GCTGTCAATT 
ATCGGCAAAA 
GTTTGGAACA 
GTCTATCAGG 
GAGAAAATAC 
CGTTCGGCTG 
ATCAGGGGAT 
TAAAAAGGCC 
AAATCGACGC 
TCCCCCTGGA 
GTCCGCCTTT 
CAGTTCGGTG 
CGACCGCTGC 
ATCGCCACTG 
TACAGAGTTC 
CTGCGCTCTG 
ACAAACCACC 
AAAGGATCTC 



CAGCTGGCAA 
TAAGCCCACT 
CCCAGTAGCT 
TCCGCTTCCT 
CCGCGTTAAA 
TCCCTTATAA 
AGAGTCCACT 
GCGATGGCGG 
CGCATCAGGC 
CGGCGAGCGG 
AACGCAGGAA 
GCGTTGCTGG 
TCAAGTCAGA 
AGCTCCCTCG 
CTCCCTTCGG 
TAGGTCGTTC 
GCCTTATCCG 
GCAGCAGCCA 
TTGAAGTGGT 
CTGAAGCCAG 
GCTGGTAGCG 
AAGAAGATCC 



TTCCGGTTCG 
GCAAGCTACC 
GACATTCATC 
TTAGCAGCCC 
TTTTTGTTAA 
ATCAAAAGAA 
ATTAAAGAAC 
ATCAGCTTAT 
GCTCTTCCGC 
TATCAGCTCA 
AGAACATGTG 
CGTTTTTCCA 
GGTGGCGAAA 
TGCGCTCTCC 
GAAGCGTGGC 
GCTCCAAGCT 
GTAACTATCG 
CTGGTAACAG 
GGCCTAACTA 
TTACCTTCGG 
GCGGTTTTTT 
TTTGATCTTT 



CTTGCTGTCC 
TGCTTTCTCT 
CGGGGTCAGC 
TTGCGCCCTG 
ATCAGCTCAT 
TAGCCCGAGA 
GTGGACTCCA 
GCGGTGTGAA 
TTCCTCGCTC 
CTCAAAGGCG 
AGCAAAAGGC 
TAGGCTCCGC 
CCCGACAGGA 
TGTTCCGACC 
GCTTTCTCAT 
GGGCTGTGTG 
TCTTGAGTCC 
GATTAGCAGA 
CGGCTACACT 
AAAAAGAGTT 
GTTTGCAAGC 
TCTTACTGAA 



ATAAAACCGC 
TTGCGCTTGC 
ACCGTTTCTG 
AGTGCTTGCG 
TTTTTAACCA 
TAGGGTTGAG 
ACGTCAAAGG 
ATACCGCACA 
ACTGACTCGC 
GTAATACGGT 
CAGCAAAAGG 
CCCCCTGACG 
CTATAAAGAT 
CTGCCGCTTA 
AGCTCACGCT 
CACGAACCCC 
AACCCGGTAA 
GCGAGGTATG 
AGAAGGACAG 
GGTAGCTCTT 
AGCAGATTAC 
CGGTGATCCC 



CCAGTCTAGC 
GTTTTCCCTT 
CGGACTGGCT 
GCAGCGTGAA 
ATAGGCCGAA 
TGTTGTTCCA 
GCGAAAAACC 
GATGCGTAAG 
TGCGCTCGGT 
TATCCACAGA 
CCAGGAACCG 
AGCATCACAA 
ACCAGGCGTT 
CCGGATACCT 
GTAGGTATCT 
CCGTTCAGCC 
GACACGACTT 
TAGGCGGTGC 
TATTTGGTAT 
GATCCGGCAA 
GCGCAGAAAA 
CACCGGAATT 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GfcCGCGGAAT TTCGACTCTA GGCCATTGCA TACGTTGTAT CTATATCATA 
TTATATTGGC TCATGTCCAA TATGACCGCC ATGTTGACAT TGATTATTGA 
ATAGTAATCA ATTACGGGGT CATTAGTTCA TAGCCCATAT ATGGAGTTCC 
ACTTACGGTA AATGGCCCGC CTGGCTGACC GCCCAACGAC CCCCGCCCAT 
AATGACGTAT GTTCCCATAG TAACGCCAAT AGGGACTTTC CATTGACGTC 
GTATTTACGG TAAACTGCCC ACTTGGCAGT ACATCAAGTG TATCATATGC 
CCCTATTGAC GTCAATGACG GTAAATGGCC CGCCTGGCAT TATGCCCAGT 
ACGGGACTTT CCTACTTGGC AGTACATCTA CGTATTAGTC ATCGCTATTA 
GCGGTTTTGG CAGTACACCA ATGGGCGTGG ATAGCGGTTT GACTCACGGG 
TCTCCACCCC ATTGACGTCA ATGGGAGTTT GTTTTGGCAC CAAAATCAAC 
AAAATGTCGT AATAACCCCG CCCCGTTGAC GCAAATGGGC GGTAGGCGTG 
GGTCTATATA AGCAGAGCTC GTTTAGTGAA CCGTCAGATC GCCTGGAGAC 
CTGTTTTGAC CTCCATAGAA GACACCGGGA CCGATCCAGC CTCCGCGGCC 
CATTGGAACG CGGATTCCCC GTGCCAAGAG TGACGTAAGT ACCGCCTATA 
GCACACCCCT TTGGCTCTTA TGCATGCTAT ACTGTTTTTG GCTTGGGGCC 
CGCTCCTTAT GCTATAGGTG ATGGTATAGC TTAGCCTATA GGTGTGGGTT 
ATTGACCACT CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 
TCTGTATTTT TACAGGATGG GGTCCATTTA TTATTTACAA ATTCACATAT 
CGTCCCCCGT GCCCGCAGTT TTTATTAAAC ATAGCGTGGG ATCTCCGACA 
GTGTTCCGGA CATGGGCTCT TCTCCGGTAG CGGCGGAGCT TCCACATCCG 
CCATCCGTCC AGCGGCTCAT GGTCGCTCGG CAGCTCCTTG CTCCTAACAG 
ACTTAGGCAC AGCACAATGC CCACCACCAC CAGT6TGCCG CACAAGGCCG 
GTATGTGTCT GAAAATGA6C TCGGAGATTG GGCTCGCACC TGGACGCAGA 
AAGGCAGCGG CAGAAGAAGA TGCAGGCAGC TGAGTTGTTG TATTCTGATA 
GTAACTCCCG TTGCGGTGCT GTTAACGGTG GAGGGCAGTG TAGTCTGAGC 
GCTGCCGCGC GCGCCACCAG ACATAATAGC TGACAGACTA ACAGACTGTT 
GGTCTTTTCT GCAGTCACCG TCGTCGACCT AAGAATTCAG ACTCGAGCAA 
CCATGGATAT CGGATCCACT ACGCGTTAGA GCTCGCTGAT CAGCCTCGAC 
AGTTGCCAGC CATCTGTTGT TTGCCCCTCC CCCGTGCCTT CCTTGACCCT 
ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CGCATTGTCT 



ATATGTACAT 
CTAGTTATTA 
GCGTTACATA 
TGACGTCAAT 
AATGGGTGGA 
CAAGTCCGCC 
ACATGACCTT 
CCATGGTGAT 
GATTTCCAAG 
GGGACTTTCC 
TACGGTGGGA 
GCCATCCACG 
GGGAACGGTG 
GACTCTATAG 
TATACACCCC 
ATTGACCATT 
ATGGCTCTTT 
TGACACGGAC 
ACAACAACGC 
TCTCGGGTAC 
AGCCCTGGTC 
TGGAGGCCAG 
TGGCGGTAGG 
TGGAAGACTT 
AGAGTCAGAG 
AGTACTCGTT 
CCTTTCCATG 
GTCTAGAAAG 
TGTGCCTTCT 
GGAAGGTGCC 
GAGTAGGTGT 
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CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG GGGAGGATTG GGAAGACAAT 
AGCAGGGGGG TGGGCGAAGA ACTCCAGCAT GAGATCCCCG CGCTGGAGGA TCATCCAGCC 
GGCGTCCCGG AAAACGATTC CGAAGCCCAA CCTTTCATAG AAGGCGGCGG TGGAATCGAA 
ATCTCGTGAT GGCAGGTTGG GCGTCGCTTG GTCGGTCATT TCGAACCCCA GAGTCCCGCT 
CAGAAGAACT CGTCAAGAAG GCGATAGAAG GCGATGCGCT GCGAATCGGG AGCGGCGATA 
CCGTAAAGCA CGAGGAAGCG GTCAGCCCAT TCGCCGCCAA GCTCTTCAGC AATATCACGG 
GTAGCCAACG CTATGTCCTG ATAGCGGTCC GCCACACCCA GCCGGCCACA GTCGATGAAT 
CCAGAAAAGC GGCCATTTTC CACCATGATA TTCGGCAAGC AGGCATCGCC ATGGGTCACG 
ACGAGATCCT CGCCGTCGGG CATGCGCGCC TTGAGCCTGG CGAACAGTTC GGCTGGCGCG 
AGCCCCTGAT GCTCTTCGTC CAGATCATCC TGATCGACAA GACCGGCTTC CATCCGAGTA 
CGTGCTCGCT CGATGCGATG TTTCGCTTGG TGGTCGAATG GGCAGGTAGC CGGATCAAGC 
GTATGCAGCC GCCGCATTGC ATCAGCCATG ATGGATACTT TCTCGGCAGG AGCAAGGTGA 
GATGACAGGA GATCCTGCCC CGGCACTTCG CCCAATAGCA GCCAGTCCCT TCCCGCTTCA 
GTGACAACGT CGAGCACAGC TGCGCAAGGA ACGCCCGTCG TGGCCAGCCA CGATAGCCGC 
GCTGCCTCGT CCTGCAGTTC ATTCAGGGCA CCGGACAGGT CGGTCTTGAC A?Uy!LAGAACC 
GGGCGCCCCT GCGCTGACAG CCGGAACACG GCGGCATCAG AGCAGCCGAT TGTCTGTTGT 
GCCCAGTCAT AGCCGAATAG CCTCTCCACC CAAGCGGCCG GAGAACCTGC GTGCAATCCA 
TCTTGTTCAA TCATGCGAAA CGATCCTCAT CCTGTCTCTT GATCAGATCT TGATCCCCTG 
CGCCATCAGA TCCTTGGCGG CAAGAAAGCC ATCCAGTTTA CTTTGCAGGG CTTCCCAACC 
TTACCAGAGG GCGCCCCAGC TGGCAATTCC GGTTCGCTTG CTGTCCATAA AACCGCCCAG 
TCTAGCTATC GCCATGTAAG CCCACTGCAA GCTACCTGCT TTCTCTTTGC GCTTGCGTTT 
TCCCTTGTCC AGATAGCCCA GTAGCTGACA TTCATCCGGG GTCAGCACCG TTTCTGCGGA 
CTGGCTTTCT ACGTGTTCCG CTTCCTTTAG CAGCCCTTGC GCCCTGAGTG CTTGCGGCAG 
CGTGAAGCTG TCAATTCCGC GTTAAATTTT TGTTAAATCA GCTCATTTTT TAACCAATAG 
GCCGAAATCG GCAAAATCCC TTATAAATCA AAAGAATAGC CCGAGATAGG GTTGAGTGTT 
GTTCCAGTTT GGAACAAGAG TCCACTATTA AAGAACGTGG ACTCCAACGT CAAAGGGCGA 
AAAACCGTCT ATCAGGGCGA TGGCGGATCA GCTTATGCGG TGTGAAATAC CGCACAGATG 
CGTAAGGAGA AAATACCGCA TCAGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG 
CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC 
CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 
GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA 
TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA 
GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG 
ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CACGCTGTAG 
GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT 
TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA 
CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG 
CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAQAA GGACAGTATT 
TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC 
CGGCAAACAA ACCACCGCTG GTAGCGGCGG I TTT T T GTTT GCAAGCAGCA GATTACGCGC 
AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CTGAACGGTG ATCCCCACCG 
GAATTGCG 

( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCGGCCGCGG AATTTCGACT CTAGGCCATT GCATACGTTG TATCTATATC ATAATATGTA 
CATTTATATT GGCTCATGTC CAATATGACC GCCATGTTGA CATTGATTAT TGACTAGTTA 
TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC 
ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC 
AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT 
GGAGTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTCC 
GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 
CTTACGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT 
GATGCGGTTT TGGCAGTACA CCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC 
AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT 
TCCAAAATGT CGTAATAACC CCGCCCCGTT GACGCAAATG GGCGGTAGGC GTGTACGGTG 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 

3 960 ^, 
4020 

4080 I 

4140 

4200.. 

4260?/ 

4320 

4328 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 




BNSDOCID: <WO 9a06437A2_l_> 



wo 98/06437 



PCT/US97/14465 



GGAGGTCTAT 

ACGCTGTTTT 

GTGCATTGGA 

TAGGCACACC 

CCCCGCTCCT 

ATTATTGACC 

TTTGCCACAA 

GACTCTGTAT 

CGCCGTCCCC 

TACGTGTTCC 

GTCCCATCCG 

CAGACTTAGG 

AGGGTATGTG 

CTTAAGGCAG 

GAGGTAACTC 

GTTGCTGCCG 

ATGGGTCTTT 

TCGCTGAGGG 

GGCTTCTCCT 

GCCTCATCTG 

ATGTCACGAT 

CCAAAGTTAA 

GGCAGGGCCT 

CTTCCCAGCC 

GCATCACCAC 

CCTCGGCTGC 

ACTCCAATTT 

ACAGATGACC 

TGCCACACCC 

CTGTCCCATG 

AGCACAACTC 

TAGTTGCCAG 

CACTCCCACT 

TCATTCTATT 

TAGCAGGGGG 

CGGCGTCCCG 

AATCTCGTGA 

TCAGAAGAAC 

ACCGTAAAGC 

GGTAGCCAAC 

TCCAGAAAAG 

GACGAGATCC 

GAGCCCCTGA 

ACGTGCTCGC 

CGTATGCAGC 

AGATGACAGG 

AGTGACAACG 

CGCTGCCTCG 

CGGGCGCCCC 

TGCCCAGTCA 

ATCTTGTTCA 

GCGCCATCAG 

CTTACCAGAG 

GTCTAGCTAT 

TTCCCTTGTC 

ACTGGCTTTC 

GCGTGAAGCT 

GGCCGAAATC 

TGTTCCAGTT 

AAAAACCGTC 

GCGTAAGGAG 

GCTCGGTCGT 

CCACAGAATC 



ATAAGCAGAG 
GACCTCCATA 
ACGCGGATTC 
CCTTTGGCTC 
TATGCTATAG 
ACTCCCCTAT 
CTATCTCTAT 
TTTTACAGGA 
CGTGCCCGCA 
GGACATGGGC 
TCCAGCGGCT 
CACAGCACAA 
TCTGAAAATG 
CGGCAGAAGA 
CCGTTGCGGT 
CGCGCGCCAC 
TCTGCAGTCA 
ACCCCGGCCA 
GTCTCTCGTG 
TGACAGCCGA 
GGGCTGTTCC 
CTTCTATGCC 
GGCCCTGCTC 
TTTCGAGCCC 
TCTGCTTOGG 
TCCACTCCGA 
CCTCCGGGGA 
AGGTGCGTCC 
TCCCTCACCA 
GACACTCCAG 
TGAGATCCAC 
CCATCTGTTG 
GTCCTTTCCT 
CTGGGGGGTG 
GTGGGCGAAG 
GAAAACGATT 
TGGCAGGTTG 

tcgtcaagaa 
acgAggaagc 

GCTATGTCCT 
CGGCCATTTT 
TCGCCGTCGG 
TGCTCTTCGT 
TCGATGCGAT 
CGCCGCATTG 
AGATCCTGCC 
TCGAGCACAG 
TCCTGCAGTT 
TGCGCTGACA 
TAGCCGAATA 
ATCATGCGAA 
ATCCTTGGCG 
GGCGCCCCAG 
CGCCAT6TAA 
CAGATAGCCC 
TACGTGTTCC 
GTCAATTCCG 
GGCAAAATCC 
TGGAACAAGA 
TATCAGGGCG 
AAAATACCGC 
TC6GCTGCGG 
AGGGGATAAC 



CTCGTTTAGT 
GAAGACACCG 
CCCGTGCCAA 
TTATGCATGC 
GTGATGGTAT 
TGGTGACGAT 
TGGCTATATG 
TGGGGTCCAT 
GTTTTTATTA 
TCTTCTCCGG 
CATGGTCGCT 
TGCCCACCAC 
AGCTCGGAGA 
AGATGCAGGC 
GCTGTTAACG 
CAGACATAAT 
CCGTCGTCGA 
GGCGCGGAGA 
TCGCTCCCTC 
GTCCTGGAGA 
GAAAGCTGCA 
TGGAAGAGGA 
TCAGAAGCTG 
CTGCAGCTGC 
GCGCTGGGAG 
ACCATCACTG 
AAGCTGAAGC 
AGCTGGGCAC 
CTCCCGAACC 
TGCCAGCAAT 
TACGCGTTAG 
TTTGCCCCTC 
AATAAAATGA 
GGGTGGGGCA 
AACTCCAGCA 
CCGAAGCCCA 
GGCGTCGCTT 
GGCGATAGAA 
GGTCAGCCCA 
GATAGCGGTC 
CCACCATGAT 
GCATGCGCGC 
CCAGATCATC 
GTTTCGCTTG 
CATCAGCCAT 
CCGGCACTTC 
CTGCGCAAGG 
CATTCAGGGC 
GCCGGAACAC 
GCCTCTCCAC 
ACGATCCTCA 
GCAAGAAAGC 
CTGGCAATTC 
GCCCACTGCA 
AGTAGCTGAC 
GCTTCCTTTA 
CGTTAAATTT 
CTTATAAATC 
GTCCACTATT 
ATGGCGGATC 
ATCAGGCGCT 
CGAGCGGTAT 
GCAGGAAAGA 



GAACCGTCAG 
GGACCGATCC 
GAGTGACGTA 
TATACTGTTT 
AGCTTAGCCT 
ACTTTCCATT 
CCAATACTCT 
TTATTATTTA 
AACATAGCGT 
TAGCGGCGGA 
CGGCAGCTCC 
CACCAGTGTG 
TTGGGCTCGC 
AGCTGAGTTG 
GTGGAGGGCA 
AGCTGACAGA 
CCTAAGAATT 
TGGGGGTGCA 
TGGGCCTCCC 
GGTACCTCTT 
GCTTGAATGA 
TGGAGGTCGG 
TCCTGCGGGG 
ACATGGATAA 
CCCAG6AAGC 
CTGACACTTT 
TGTACACGGG 
ATCCACCACC 
CCATCGAGGG 
GACATCTCAG 
AGCTCGCTGA 
CCCCGTGCCT 
GGAAATTGCA 
GGACAGCAAG 
TGAGATCCCC 
ACCTTTCATA 
GGTCGGTCAT 
GGCGATGCGC 
TTCGCCGCCA 
CGCCACACCC 
ATTCGGC7LAG 
CTTGAGCCTG 
CTGATCGACA 
GTGGTCGAAT 
GATGGATACT 
GCCCAATAGC 
AACGCCCGTC 
ACCGGACAGG 
GGCGGCATCA 
CCAAGCGGCC 
TCCTGTCTCT 
CATCCAGTTT 
CGGTTCGCTT 
AGCTACCTGC 
ATTCATCCGG 
GCAGCCCTTG 
TTGTTAAATC 
AAAAGAATAG 
AAA6AACGTG 
AGCTTATGCG 
CTTCCGCTTC 
CAGCTCACTC 
ACATGTGAGC 



ATCGCCTGGA 

AGCCTCCGCG 

AGTACCGCCT 

TTGGCTTGGG 

ATAGGTGTGG 

ACTAATCCAT 

GTCCTTCAGA 

CAAATTCACA 

GGGATCTCCG 

GCTTCCACAT 

TTGCTCCTAA 

CCGCACAAGG 

ACCTGGACGC 

TTGTATTCTG 

GTGTAGTCTG 

CTAACAGACT 

CAGACTCGAG 

CGAATGTCCT 

AGTCCCGGGC 

GGAGGCCAAG 

GAATATCACC 

GCAGCAGGCT 

CCAGGCCGTG 

AGCCATCAGT 

CATCTCCCTC 

CTGCAAACTC 

GGAGGCCTGC 

TCCCTCACCA 

GCTCTCAGCT 

GGGCCAGAGG 

TCAGCCTCGA 

TCCTTGACCC 

TCGCATTGTC 

GGGGAGGATT 

GCGCTGGAGG 

GAAGGCGGCG 

TTCGAACCCC 

TGCGAATCGG 

AGCTCTTCAG 

AGCCGGCCAC 

CAGGCATCGC 

GCGAACAGTT 

AGACCGGCTT 

GGGCAGGTAG 

TTCTCGGCAG 

AGCCAGTCCC 

GTGGCCAGCC 

TCGGTCTTGA 

GAGCAGCCGA 

GGAGAACCTG 

TGATCAGATC 

ACTTTGCAGG 

GCTGTCCATA 

TTTCTCTTTG 

GGTCAGCACC 

CGCCCTGAGT 

AGCTCATTTT 

CCCGAGATAG 

GACTCCAACG 

GTGTGAAATA 

CTCGCTCACT 

AAAGGCGGTA 

AAAAGGCCAG 



GACGCCATCC 720 

GCCGGGAACG 780 

ATAGACTCTA 840 

GCCTATACAC 900 

GTTATTGACC 960 

AACATGGCTC 1020 

GACTGACACG 1080 

TATACAACAA 1140 

ACATCTCGGG 1200 

CCGAGCCCTG 1260 

CAGTGGAGGC 13 20 

CCGTGGCGGT 13 80 

AGATGGAAGA 1440 

ATAAGAGTCA 1500 

AGCAGTACTC 1560 

GTTCCTTTCC 1620 

CAAGTCTAGG 1680 

GCCTGGCTGT 1740 

GCCCCACCAC 1800 

GAGGCCGAGA 1860 

GTCCCAGACA 192 0 

GTAGAAGTCT 1980 

TTGGCCAACT 2040 

GGCCTTCGCA 2100 

CCAGATGCGG 2160 

TTCCGAGTCT 2220 

AGG AGAGGGG 2280 

AC ACTGCCTG 2340 

AAGCGCCAGC 2400 

AACTGTCCAG 2460 

CTGTGCCTTC 2 520 

TGGAAGGTGC 2580 

TGAGTAGGTG 2640 

GGGAAGACAA 2700 

ATCATCC AGC 2760 

GTGGAATCGA 2820 

AGAGTCCCGC 2880 

GAGCGGCG AT 2940 

C AAT ATC ACG 3000 

AGTCGATGAA 3060 

CATGGGTCAC 3120 

CGGCTGGCGC 3180 

CCATCCGAGT 3240 

CCGGATCAAG 3300 

GAGCAAGGTG 3360 

TTCCCGCTTC 3420 

ACGAT AGCCG 3480 

CAAAAAGAAC 3540 

TTGTCTGTTG 3600 

CGTGCAATCC 3660 

TTGATCCCCT 3720 

GCTTCCCAAC 3780 

AAACCGCCCA 3840 

CGCTTGCGTT 3900 

GTTTCTGCGG 3960 

GCTTGCGGC A 4020 

TTAACCAATA 4080 

GGTTGAGTGT 4140 

TCAAAGGGCG 4200 

CCGCACAGAT 4260 

GACTCGCTGC 4320 

ATACGGTTAT 4380 

CAAAAGGCCA 4440 



BNSDOCID: <WO 9806437A2_L> 



wo 98/06437 



GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 
ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC 
AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 
GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCATAGC TCACGCTGTA 
GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 
TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 
ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 
GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG CTACACTAGA AGGACAGTAT 
TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 
CCGGCAAACA AACCACCGCT GGTAGCGGCG GTTTTTTGTT TGCAAGCAGC AGATTACGCG 
CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT TACTGAACGG TGATCCCCAC 
CGGAATT 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGGCCGCGG AATTTCGACT CTAGGCCATT GCATACGTTG TATCTATATC ATAATATGTA 
CATTTATATT GGCTCATGTC CAATATGACC GCCATGTTGA CATTGATTAT TGACTAGTTA 
TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC 
ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC 
AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT 
GGAGTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTCC 
GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 
CTTACGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT 
GATGCGGTTT TGGCAGTACA CCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC 
AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT 
TCCAAAATGT CGTAATAACC CCGCCCCGTT GACGCAAATG GGCGGTAGGC GTGTACGGTG 
GGAGGTCTAT ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC 
ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG GCCGGGAACG 
GTGCATTGGA ACGCGGATTC CCCGTGCCAA GAGTGACGTA AGTACCGCCT ATAGACTCTA 
TAGGCACACC CCTTTGGCTC TTATGCATGC TATACTGTTT TTGGCTTGGG GCCTATACAC 
CCCCGCTCCT TATGCTATAG GTGATGGTAT AGCTTAGCCT ATAGGTGTGG GTTATTGACC 
ATTATTGACC ACTCCCCTAT TGGTGACGAT ACTTTCCATT ACTAATCCAT AACATGGCTC 
TTTGCCACAA CTATCTCTAT TGGCTATATG CCAATACTCT GTCCTTCAGA GACTGACACG 
GACTCTGTAT TTTTACAGGA TGGGGTCCAT TTATTATTTA CAAATTCACA TATACAACAA 
CGCCGTCCCC CGTGCCCGCA GTTTTTATTA AACATAGCGT GGGATCTCCG ACATCTCGGG 
TACGTGTTCC GGACATGGGC TCTTCTCCGG TAGCGGCGGA GCTTCCACAT CCGAGCCCTG 
GTCCCATCCG TCCAGCGGCT CATGGTCGCT CGGCAGCTCC TTGCTCCTAA CAGTGGAGGC 
CAGACTTAGG CACAGCACAA TGCCCACCAC CACCAGTGTG CCGCACAAGG CCGTGGCGGT 
AGGGTATGTG TCTGAAAATG AGCTCGGAGA TTGGGCTCGC ACCTGGACGC AGATGGAAGA 
CTTAAGGCAG CGGCAGAAGA AGATGCAGGC AGCTGAGTTG TTGTATTCTG ATAAGAGTCA 
GAGGTAACTC CCGTTGCGGT GCTGTTAACG GTGGAGGGCA GTGTAGTCTG AGCAGTACTC 
GTTGCTGCCG CGCGCGCCAC CAGACATAAT AGCTGACAGA CTAACAGACT GTTCCTTTCC 
ATGGGTCTTT TCTGCAGTCA CCGTCGTCGA CCTAAGAATT CAGACTCGAG CAAGTCTAGA 
ATGTGCTGGA GACCCCTGTG TCGGTTCCTG TGGCTTTGGT CCTATCTGTC TTATGTTCAA 
GCAGTGCCTA TCCAGAAAGT CCAGGATGAC ACCAAAACCC TCATCAAGAC CATTGTCACC 
AGGATCAATG ACATTTCACA CACGCAGTCG GTATCCGCCA AGCAGAGGGT CACTGGCTTG 
GACTTCATTC CTGGGCTTCA CCCCATTCTG AGTTTGTCCA AGATGGACCA GACTCTGGCA 
GTCTATCAAC AGGTCCTCAC CAGCCTGCCT TCCCAAAATG TGCTGCAGAT AGCCAATGAC 
CTGGAGAATC TCCGAGACCT CCTCCATCTG CTGGCCTTCT CCAAGAGCTG CTCCCTGCCT 
CAGACCAGTG GCCTGCAGAA GCCAGAGAGC CTGGATGGCG TCCTGGAAGC CTCACTCTAC 
TCCACAGAGG TGGTGGCTTT GAGCAGGCTG CAGGGCTCTC TGCAGGACAT TCTTCAACAG 
TTGGATGTTA GCCCTGAATG TGAGGATCCA CTACGCGTTA GAGCTCGCTG ATCAGCCTCG 
ACTGTGCCTT CTAGTTGCCA GCCATCTGTT GTTTGCCCCT CCCCCGTGCC TTCCTTGACC 
CTGGAAGGTG CCACTCCCAC TGTCCTTTCC TAATAAAATG AGGAAATTGC ATCGCATTGT 
CTGAGTAGGT GTCATTCTAT TCTGGGGGGT GGGGTGGGGC AGGACAGCAA GGGGGAGGAT 
TGGGAAGACA ATAGCAGGGG GGTGGGCGAA GAACTCCAGC ATGAGATCCC CGCGCTGGAG 



4500 
4560 
4620 
46B0 
4740 
4800 
4B60 
4920 
4980 
5040 
5100 
5107 
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GATCATCCAG CCGGCGTCCC GGAAAACGAT TCCGAAGCCC AACCTTTCAT AGAAGGCGGC 
GGTGGAATCG AAATCTCGTG ATGGCAGGTT GGGCGTCGCT TGGTCGGTCA TTTCGAACCC 
CAGAGTCCCG CTCAGAAGAA CTCGTCAAGA AGGCGATAGA AGGCGATGCG CTGCGAATCG 
GGAGCGGCGA TACCGTAAAG CACGAGGAAG CGGTCAGCCC ATTCGCCGCC AAGCTCTTCA 
GCAATATCAC GGGTAGCCAA CGCTATGTCC TGATAGCGGT CCGCCACACC CAGCCGGCCA 
CAGTCGATGA ATCCAGAAAA GCGGCCATTT TCCACCATGA TATTCGGCAA GCAGGCATCG 
CCATGGGTCA CGACGAGATC CTCGCCGTCG GGCATGCGCG CCTTGAGCCT GGCGAACAGT 
TCGGCTGGCG CGAGCCCCTG ATGCTCTTCG TCCAGATCAT CCTGATCGAC AAGACCGGCT 
TCCATCCGAG TACGTGCTCG CTCGATGCGA TGTTTCGCTT GGTGGTCGAA TGGGCAGGTA 
GCCGGATCAA GCGTATGCAG CCGCCGCATT GCATCAGCCA TGATGGATAC TTTCTCGGCA 
GGAGCAAGGT GAGATGACAG GAGATCCTGC CCCGGCACTT CGCCCAATAG CAGCCAGTCC 
CTTCCCGCTT CAGTGACAAC GTCGAGCACA GCTGCGCAAG GAACGCCCGT CGTGGCCAGC 
CACGATAGCC GCGCTGCCTC GTCCTGCAGT TCATTCAGGG CACCGGACAG GTCGGTCTTG 
ACAAAAAGAA CCGGGCGCCC CTGCGCTGAC AGCCGGAACA CGGCGGCATC AGAGCAGCCG 
ATTGTCTGTT GTGCCCAGTC ATAGCCGAAT AGCCTCTCCA CCCAAGCGGC CGGAGAACCT 
GCGTGCAATC CATCTTGTTC AATCATGCGA AACGATCCTC ATCCTGTCTC TTGATCAGAT 
CTTGATCCCC TGCGCCATCA GATCCTTGGC GGCAAGAAAG CCATCCAGTT TACTTTGCAG 
GGCTTCCCAA CCTTACCAGA GGGCGCCCCA GCTGGCAATT CCGGTTCGCT TGCTGTCCAT 
AAAACCGCCC AGTCTAGCTA TCGCCATGTA AGCCCACTGC AAGCTACCTG CTTTCTCTTT 
GCGCTTGCGT TTTCCCTTGT CCAGATAGCC CAGTAGCTGA CATTCATCCG GGGTCAGCAC 
CGTTTCTGCG GACTGGCTTT CTACGTGTTC CGCTTCCTTT AGCAGCCCTT GCGCCCTGAG 
TGCTTGCGGC AGCGTGAAGC TGTCAATTCC GCGTTAAATT TTTGTTAAAT CAGCTCATTT 
TTTAACCAAT AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA GCCCGAGATA 
GGGTTGAGTG TTGTTCCAGT TTGGAACAAG AGTCCACTAT TAAAGAACGT GGACTCCAAC 
GTCAAAGGGC GAAAAACCGT CTATCAGGGC GATGGCGGAT CAGCTTATGC GGTGTGAAAT 
ACCGCACAGA TGCGTAAGGA GAAAATACCG CATCAGGCGC TCTTCCGCTT CCTCGCTCAC 
TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA TCAGCTCACT CAAAGGCGGT 
AATACGGTTA TCCACAGAAT CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA 
GCAAAAGGCC AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 
CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 
ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT 
GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC TTTCTCATAG 
CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA 
CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA 
CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 
GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG GCTACACTAG 
AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 
TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGC GGTTTTTTGT TTGCAAGCAG 
CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TTACTGAACG 
GTGATCCCCA CCGGAATT 
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WHAT IS CLAIMED: 

1. A polycationic agent having the following formula: 

Ri Ro O 
I T II 

Ta I — N— C— C — I Tc 



R. 



n 



wherein 

n is an integer from 10 to 100; 
Rp Rj, and R, for each monomer, 

Ri Ro O 
I T' II 

— N— C— C — 

R3 

are independently selected from moieties having a molecular weight from 1 to 200 
daltons; 

Ta and Tc are terminating groups; 

R, is not hydrogen for at least one monomer; 

wherein said polycationic agent comprises at least 25% positively charged 
monomerSt excluding the terminal groups, and 

wherein said polycationic agent exhibits a net positive electrical charge at a 
physiological pH. 



2. The polycationic agent according to claim 1 , wherein said polycationic 
agent comprises repeating trimers. 



3. The polycationic agent according to claim 2, wherein two R, groups in 
each trimer are neutral moieties and one R, group in each trimer is a cationic moiety. 

4. The polycationic agent according to claim 1, wherein R, is selected from 
the group consisting of positively charged, negatively charged and neutral moieties. 

5. The polycationic agent according to claim 1, wherein R, is selected from 
substituents found on amino acids. 

S6 
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6. The polycationic agent of claim K wherein R, is selected from the group 
consisting of aromatic and aliphatic groups. 

7. The polycationic agent according to claim 1, wherein at least one R, is 
selected from the group consisting of alkylammonium, aminoalkyl, guanidinoalkyl, 
amidinoalkyl, aminocyclohexyl, piperidyi, guanidinobenzyl, amidinobenzyl, 
pyridylmethyl, aminobenzyl, alkyphenyl, indolylalkyl, alkoxyphenylalkyl, 
halophenyialkyU and hydroxyphenylalkyl. 

8. The polycationic agent according to claim 3, wherein said cationic 
moeity is aminoethyL 

9. The polycationic agent according to claim 8, wherein said neutral 
moieties are selected from the group consisting of phenethyl, benzyl, phenylpropyl, 
(R) alpha*methylbenzyU (S) alpha-methylbenzyl, methoxyphenethyl. and 
chlorophenethyl. 

10. The polycationic agent of claim 1, wherein R, and R3 are both hydrogen 
for at least one monomer. 

11. The polycationic agent of claim 10 wherein n is 36. 

12. The polycationic agent of claim 10, wherein n is 24. 

13. The polycationic agent of claim 10, wherein n is 18. 

14. The polycationic agent of claim 10, wherein n is 12. 

15. The polycationic agent of claim 8, wherein Ta and Tc are terminal 
groups selected from the group consisting of polypeptide, lipid, lipoprotein, vitamin, 
hormone, polyakylene glycol, and saccharide. 

16. A composition comprising: 

21^ 
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<i) a polynucleotide; and 

(ii) the polycationic agent of claim 1, wherein said polycationic agent is 
capable of mediating entry of polynucleotides into a cell. 

17. A phamiaceutical composition comprising: 

(i) a pharmaceutically acceptable carrier; 

(ii) a therapeutically effective amount of polynucleotides; and 

(iii) an amount effective to neutralize said polynucleotides of the polycationic 
agent of claim 1 wherein said polycationic agent is capable of mediating entry of 
polynucleotides into a cell. 

18. A method of complexing polynucleotides with a polycationic agent 
comprising: 

(i) providing a polynucleotide; and 

(ii) contacting said polynucleotide with the polycationic agent of claim 1 , 
wherein said polycationic agent is capable of mediating entry of polynucleotides into 
a cell. 

19. A method of condensing polynucleotides, said method comprising: 
contacting a polynucleotide with a condensing amount of the polycationic agent of 
claim 1 , wherein said condensing amount is an amount of polycationic agent 
sufficient to reduce the size of said polynucleotide. 

20. A method of inhibiting serum degradation of polynucleotides, said 
method comprising contacting a polynucleotide with the polycationic agent of claim 1 
wherein said polycationic agent is present in an amount effective to inhibit seram 
degradation by at least 10 minutes. 

21. A composition comprising : 

(i) a lipoprotein; 

(ii) a polynucleotide binding molecule; and 

(iii) a polynucleotide. 
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wherein said composition is capable of increasing the frequency of polynucleotide 
uptake into a cell. 

22. The composition of claim 21, wherein the lipoprotein is selected from 
the group consisting of high density lipoprotein, intermediate density lipoprotein, low 
density lipoprotein, and very low density lipoprotein. 

23. The composition of claim 21, wherein the lipoprotein a mutant, fragment 
or fusion of the protein selected from the group consisting of high density lipoprotein, 
intermediate density lipoprotein, low density lipoprotein, and very low density 
lipoprotein. 

24. The composition of claim 21, wherein the lipoprotein is acetylated low 
density lipoprotein. 

25. A pharmaceutical composition comprising 

(a) a therapeutically effective amount of a polynucleotide; 

(b) a polynucleotide binding molecule in an amount effective to neutralize 
the negative charge of said polynucleotide; and 

(c) a therapeutically effective amount of lipoprotein. 

26. The pharmaceutical composition of claim 26, wherein said 
polynucleotide is a polycationic agent. 

27- A method of producing a composition for facilitating entry of a 
polynucleotide into a cell said method comprising: 

(i) providing a polynucleotide 

(ii) providing a polynucleotide binding molecule in an amount effective to 
neutralize said polynucleotide; 

(ii) contacting said polynucleotide with said polynucleotide binding 
molecule to form a complex; 

(iii) providing a lipoprotein; then 

(iv) contacting the complex with said lipoprotein. 
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